groq-multi-env-setup
Installation
SKILL.md
Groq Multi-Environment Setup
Overview
Configure Groq across environments with the right balance of cost, speed, and capability per tier. Groq's key differentiator is inference speed (100-300 tokens/second), but rate limits differ dramatically by plan: free tier is 30 RPM / 14,400 RPD for llama-3.1-70b, while paid tier removes most limits.
Prerequisites
- Groq API key(s) per environment from console.groq.com
- Environment variable management (
.env.local, GitHub Secrets, or cloud secret manager) - Understanding of Groq's model tiers and rate limits
Environment Strategy
| Environment | Model | Rate Limit Risk | Config Source |
|---|---|---|---|
| Development | llama-3.1-8b-instant |
Low (small model) | .env.local |
| Staging | llama-3.1-70b-versatile |
Medium | CI/CD secrets |
| Production | llama-3.1-70b-versatile or llama-3.3-70b-specdec |
Managed with retry | Secret manager |
Instructions
Related skills