Using Deepgram Text-to-Speech (Python SDK)

Convert text to audio: one-shot REST download or low-latency streaming synthesis via /v1/speak.

When to use this product

REST (speak.v1.audio.generate) — one-shot synthesis, returns audio bytes. Use for rendered files, pre-generated prompts, anything where you have the full text upfront.
WebSocket (speak.v1.connect) — incremental text input, streaming audio output. Use for low-latency playback while an LLM is still producing tokens.

Use a different skill when:

You need the agent to also listen and converse (full-duplex) → deepgram-python-voice-agent.

from dotenv import load_dotenv
load_dotenv()

Installs

Repository

GitHub Stars

447

First Seen

May 12, 2026

Security Audits