ai-ml-infra

Installation
SKILL.md

AI/ML Infrastructure

Model serving with KubeAI, GPU scheduling, and inference patterns.

Model Deployment Options

Feature KubeAI Ollama Operator LlamaStack
Backend vLLM (GPU optimized) Ollama (easy) Multi-backend
Scale from zero Yes No No
OpenAI API Native Compatible Compatible
Best for Production GPU CPU/mixed Full AI stack

KubeAI Setup

Model CRD

Installs
3
Repository
5dlabs/cto
First Seen
Jan 24, 2026
ai-ml-infra — 5dlabs/cto