🎙️ AudioMind

Use when: User asks to generate speech, narrate text, create a voice-over, compose music, or produce a sound effect.

AudioMind is a smart audio dispatcher. It analyzes your request and routes it to the best available model — ElevenLabs for speech and music, fal.ai for fast SFX — and returns a ready-to-use audio URL.

Quick Reference

Request Type	Best Model	Latency
Narrate text / Voice-over	`elevenlabs-tts-v3`	~3s
Low-latency TTS (real-time)	`elevenlabs-tts-turbo`	<1s
Background music	`cassetteai-music`	~15s
Sound effect	`elevenlabs-sfx`	~5s
Clone a voice from audio	`elevenlabs-voice-clone`	~10s

Installs

Repository

wells1137/media-skills

GitHub Stars

First Seen

Mar 4, 2026

Security Audits

Gen Agent Trust HubPass

SocketPass

SnykPass