AudioMind
Installation
SKILL.md
🎙️ AudioMind
Use when: User asks to generate speech, narrate text, create a voice-over, compose music, or produce a sound effect.
AudioMind is a smart audio dispatcher. It analyzes your request and routes it to the best available model — ElevenLabs for speech and music, fal.ai for fast SFX — and returns a ready-to-use audio URL.
Quick Reference
| Request Type | Best Model | Latency |
|---|---|---|
| Narrate text / Voice-over | elevenlabs-tts-v3 |
~3s |
| Low-latency TTS (real-time) | elevenlabs-tts-turbo |
<1s |
| Background music | cassetteai-music |
~15s |
| Sound effect | elevenlabs-sfx |
~5s |
| Clone a voice from audio | elevenlabs-voice-clone |
~10s |
Related skills