Vram-GPU-OOM
Installation
SKILL.md
GPU OOM Retry Pattern
Simple pattern for sharing GPU memory across multiple services without coordination.
Strategy
- All services try to load models normally
- Catch OOM errors
- Wait 30-60 seconds (for other services to auto-unload)
- Retry up to 3 times
- Configure all services to unload quickly when idle
Python (PyTorch / Transformers)
import torch
import time
Related skills
More from lawless-m/claude-skills
rust
Rust development patterns, project setup, CLI/TUI applications, error handling, and system integration
1browserbridge
Real-time browser debugging and interaction via WebSocket bridge server on localhost:3141
1web-frontend
Suite of tools for creating elaborate, multi-component claude.ai HTML artifacts using modern frontend web technologies (React, Tailwind CSS, shadcn/ui). Use for complex artifacts requiring state management, routing, or shadcn/ui components - not for simple single-file HTML/JSX artifacts.
1postgresql
PostgreSQL database operations using PgQuery tool for DDL execution, schema management, and query operations
1