voice-ai-integration

Installation
SKILL.md

Voice AI Integration

Build intelligent voice-enabled AI applications that understand spoken language and respond naturally through audio, creating seamless voice-first user experiences.

Overview

Voice AI systems combine three key capabilities:

  1. Speech Recognition - Convert audio input to text
  2. Natural Language Processing - Understand intent and context
  3. Text-to-Speech - Generate natural-sounding responses

Speech Recognition Providers

See examples/speech_recognition_providers.py for implementations:

  • Google Cloud Speech-to-Text: High accuracy with automatic punctuation
  • OpenAI Whisper: Robust multilingual speech recognition
  • Azure Speech Services: Enterprise-grade speech recognition
  • AssemblyAI: Async processing with high accuracy
Related skills
Installs
133
GitHub Stars
13
First Seen
Jan 22, 2026