voice-agent

Installation
SKILL.md

Voice Agent

This skill allows you to speak and listen to the user using the local Voice Agent API.

Behavior Guidelines

  • Audio First: When the user communicates via audio (files), your PRIMARY mode of response is Audio File.
  • Silent Delivery: When sending an audio response, DO NOT send a text explanation like "I sent an audio". Just send the audio file.
  • Workflow:
    1. User sends audio.
    2. You usage transcribe to read it.
    3. You think of a response.
    4. You usage synthesize to generate the audio file.
    5. You send the file.
    6. STOP. Do not add text commentary.

Tools

Transcribe File

To transcribe an audio file (Speech-to-Text), run the client script with the transcribe command.

Related skills
Installs
3
GitHub Stars
2
First Seen
Mar 1, 2026