fal-serverless-guide
Installation
SKILL.md
Quick Reference
| Machine Type | GPU | VRAM | Use Case |
|---|---|---|---|
GPU-T4 |
T4 | 16GB | Dev, small models |
GPU-A10G |
A10G | 24GB | 7B-13B models |
GPU-A100 |
A100 | 40/80GB | 13B-70B models |
GPU-H100 |
H100 | 80GB | Cutting-edge |
| App Attribute | Purpose | Example |
|---|---|---|
machine_type |
GPU selection | "GPU-A100" |
requirements |
Dependencies | ["torch", "transformers"] |
keep_alive |
Warm duration | 300 (5 min) |
min_concurrency |
Min instances | 0 (scale to zero) |
max_concurrency |
Max parallel | 4 |