runpod-serverless-builder
RunPod Serverless Builder
Build end-to-end RunPod serverless endpoints optimized for extremely short cold start times.
Capabilities
Create production-ready RunPod serverless workers for:
- vLLM - High-performance LLM inference
- ComfyUI - Image/video generation with workflow support
- Custom Inference - User-provided Python inference code
Loading Strategies:
- Baked Models: Models embedded in Docker image for fastest cold starts (<5s)
- Dynamic Loading: Models loaded from network storage at runtime (shared across workers)
Quick Start
Use the interactive project generator:
More from avivk5498/my-claude-code-skills
ceo-companion
Collaborative CEO co-pilot for SaaS strategy sessions. Researches markets, validates ideas, designs UI inspiration boards, and produces a .strategy/ folder that Beads Orchestration consumes for autonomous building. Use as Session 1 before a Beads build session.
17create-beads-orchestration
Bootstrap lean multi-agent orchestration with beads task tracking. Use for projects needing agent delegation without heavy MCP overhead.
9agent-debugger
Systematic debugging toolkit for AI agentic workflows in customer support. Use when diagnosing issues with AI agents including wrong responses, tool/function calling problems, conversation loops, stuck states, or performance/latency issues. Works with any framework (LangChain, custom agents, Claude API) and accepts conversation logs, API logs, tool execution logs, and agent configurations.
6agentform
|
5