audio-language-models

Installation
SKILL.md

Audio Language Models ()

Build real-time voice agents and audio processing using the latest native speech-to-speech models.

Overview

  • Real-time voice assistants and agents
  • Live conversational AI (phone agents, support bots)
  • Audio transcription with speaker diarization
  • Multilingual voice interactions
  • Text-to-speech generation
  • Voice-to-voice translation

Model Comparison (January )

Real-Time Voice (Speech-to-Speech)

Model Latency Languages Price Best For
Related skills

More from yonatangross/orchestkit

Installs
13
GitHub Stars
170
First Seen
Jan 22, 2026