tts
Installation
SKILL.md
Vox TTS / STT (Apple Silicon)
Local text-to-speech, speech-to-text, and voice cloning powered by Qwen3-TTS/ASR + MLX on Apple Silicon.
Description
A powerful local TTS/STT skill based on Vox CLI that runs entirely on Apple Silicon Macs. Features speech synthesis with preset and custom voices, voice cloning from audio samples, speech recognition with subtitle generation, and batch processing. All models run locally via MLX - your data never leaves your machine.
When to Use
Use this skill when users:
- Want to convert text to speech, "read this aloud", "generate audio from text"
- Ask for "TTS", "text to speech", "语音合成", "文字转语音", "朗读"
- Want to transcribe audio to text, "STT", "speech to text", "语音识别", "转录"
- Need voice cloning from a sample, "clone this voice", "克隆声音"
- Want to generate subtitles (SRT/VTT) from audio
- Ask for batch text-to-speech conversion
- Mention "vox" or "Qwen3-TTS"