Deepgram Performance Tuning

Overview

Optimize Deepgram transcription performance through audio preprocessing with ffmpeg, model selection for speed vs accuracy, streaming for large files, parallel processing, result caching, and connection reuse. Targets: <2s latency for short files, 100+ files/minute batch throughput.

Performance Levers

Factor	Impact	Default	Optimized
Audio format	High	Any format	16kHz mono WAV
Model	High	nova-3	base (speed) or nova-3 (accuracy)
File size	High	Full file sync	Stream >60s, callback >5min
Concurrency	Medium	Sequential	50 parallel (p-limit)
Caching	Medium	None	Redis hash by audio+options
Features	Medium	All enabled	Disable unused (diarize, utterances)

Instructions

Step 1: Audio Preprocessing with ffmpeg

Installs

Repository

jeremylongshore…s-skills

GitHub Stars

2.5K

First Seen

Feb 18, 2026

Security Audits

Gen Agent Trust HubWarn

SocketPass

SnykWarn