ai-voice-cloning
Installation
Summary
Natural AI voice generation across seven models with 22+ voices, multiple languages, and emotional range.
- Supports ElevenLabs (premium quality, 32 languages), Kokoro TTS, DIA, Chatterbox, Higgs, and VibeVoice, each optimized for different styles from professional narration to casual conversation
- Includes 16+ named voices with gender and style profiles (e.g., warm, authoritative, youthful) plus speed control (0.8–1.2x) and punctuation-based pacing
- Handles multi-voice conversations, long-form content chunking with crossfade merging, and video integration for voiceovers and talking-head avatars
- Accessed via
infshCLI with straightforward JSON input; works for voiceovers, audiobooks, podcasts, e-learning, accessibility, and IVR systems
SKILL.md
Install the belt CLI skill:
npx skills add belt-sh/cli
AI Voice Generation
Generate natural AI voices via inference.sh CLI.

Quick Start
Requires inference.sh CLI (
belt). Install instructions
belt login
Related skills