mm-voice-maker

Installation
SKILL.md

MiniMax Voice Maker

Professional text-to-speech skill with emotion detection, voice cloning, and audio processing capabilities powered by MiniMax Voice API and FFmpeg.

Capabilities

Area Features
TTS Sync (HTTP/WebSocket), async (long text), streaming
Segment-based Multi-voice, multi-emotion synthesis from segments.json, auto merge
Voice Cloning (10s–5min), design (text prompt), management
Audio Format conversion, merge, normalize, trim, remove silence (FFmpeg)

File structure:

mmVoice_Maker/
├── SKILL.md                       # This overview
├── mmvoice.py                     # CLI tool (recommended for Agents)
Related skills
Installs
3
GitHub Stars
27
First Seen
Mar 17, 2026