truefoundry-llm-deploy
Installation
SKILL.md
Routing note: For ambiguous user intents, use the shared clarification templates in references/intent-clarification.md.
LLM / Model Deployment
Deploy large language models and ML inference servers to TrueFoundry. Supports vLLM, TGI, and custom model servers with proper GPU allocation, model caching, health probes, and production-ready defaults.
Two paths:
- CLI (
tfy apply) -- Write a YAML manifest and apply it. Works everywhere. - REST API (fallback) -- When CLI unavailable, use
tfy-api.sh.