deepgram-python-text-to-speech
Installation
SKILL.md
Using Deepgram Text-to-Speech (Python SDK)
Convert text to audio: one-shot REST download or low-latency streaming synthesis via /v1/speak.
When to use this product
- REST (
speak.v1.audio.generate) — one-shot synthesis, returns audio bytes. Use for rendered files, pre-generated prompts, anything where you have the full text upfront. - WebSocket (
speak.v1.connect) — incremental text input, streaming audio output. Use for low-latency playback while an LLM is still producing tokens.
Use a different skill when:
- You need the agent to also listen and converse (full-duplex) →
deepgram-python-voice-agent.
Authentication
from dotenv import load_dotenv
load_dotenv()