deepgram-python-speech-to-text
Installation
SKILL.md
Using Deepgram Speech-to-Text (Python SDK)
Basic transcription (ASR) for prerecorded audio (REST) or live audio (WebSocket) via /v1/listen.
When to use this product
- REST (
transcribe_url/transcribe_file) — one-shot transcription of a complete file or URL. Use for batch jobs, captioning pipelines, offline analysis. - WebSocket (
listen.v1.connect) — continuous streaming transcription. Use for live captions, real-time microphone input, phone audio.
Use a different skill when:
- You want summaries, sentiment, topics, intents, diarization, or redaction on the audio →
deepgram-python-audio-intelligence(same endpoint, different params). - You need turn-taking / end-of-turn events →
deepgram-python-conversational-stt(v2 / Flux). - You need a full-duplex interactive assistant (STT + LLM + TTS + function calls) →
deepgram-python-voice-agent.