deepgram-rate-limits

Installation
SKILL.md

Deepgram Rate Limits

Overview

Implement rate limiting, exponential backoff, and circuit breaker patterns for Deepgram API. Deepgram limits by concurrent connections (not requests per second). Understanding this model is key to building reliable integrations.

Deepgram Rate Limit Model

Deepgram uses concurrency-based limits, not traditional requests-per-minute:

Plan Concurrent Requests (STT) Concurrent Connections (Live) Concurrent Requests (TTS)
Pay As You Go 100 100 100
Growth 200 200 200
Enterprise Custom Custom Custom

When you exceed your concurrency limit, Deepgram returns 429 Too Many Requests.

Key insight: You can send unlimited total requests — just not more than your concurrency limit simultaneously.

Instructions

Related skills
Installs
23
GitHub Stars
2.2K
First Seen
Feb 18, 2026