ai-ml-infra
Installation
SKILL.md
AI/ML Infrastructure
Model serving with KubeAI, GPU scheduling, and inference patterns.
Model Deployment Options
| Feature | KubeAI | Ollama Operator | LlamaStack |
|---|---|---|---|
| Backend | vLLM (GPU optimized) | Ollama (easy) | Multi-backend |
| Scale from zero | Yes | No | No |
| OpenAI API | Native | Compatible | Compatible |
| Best for | Production GPU | CPU/mixed | Full AI stack |