ai-podcast-creation
Multi-voice podcast and audiobook production with TTS, AI music, and audio merging.
- Supports three TTS engines (Kokoro, DIA, Chatterbox) with 6+ voice options across American, British, and conversational styles
- Includes AI music generation for intros, outros, and background tracks; media merger handles crossfades and layering
- Workflows cover single narration, multi-voice dialogue, full episode pipelines, and NotebookLM-style document discussions
- Integrates with Claude for script generation and supports audiobook chapters with adjustable speech speed
Install the belt CLI skill:
npx skills add belt-sh/cli
AI Podcast Creation
Create AI-powered podcasts and audio content via inference.sh CLI.

Quick Start
Requires inference.sh CLI (
belt). Install instructions
belt login
More from inferen-sh/skills
elevenlabs-dubbing
0qwen-image-2-pro
Generate images with Alibaba Qwen-Image-2.0-Pro via inference.sh CLI. Professional text rendering, fine-grained realism, enhanced semantic adherence. Ideal for posters, banners, and text-heavy designs. Triggers: qwen image pro, qwen-image-pro, qwen 2 pro, alibaba image pro, dashscope pro, professional text rendering
0twitter-thread-creation
0text-to-speech
Convert text to natural speech with Inworld TTS, ElevenLabs, DIA TTS, Kokoro, Chatterbox, and more via inference.sh CLI. Models: Inworld TTS-2 (100+ languages, emotion steering), Inworld TTS 1.5 (ultra-low latency), ElevenLabs (premium, 22+ voices, 32 languages), DIA TTS (conversational), Kokoro TTS, Chatterbox, Higgs Audio, VibeVoice (podcasts). Capabilities: text-to-speech, voice cloning, multi-speaker dialogue, podcast generation, expressive speech, emotion/delivery steering, character voices. Use for: voiceovers, audiobooks, podcasts, accessibility, video narration, IVR, voice assistants, gaming characters, avatar audio. Triggers: text to speech, tts, voice generation, ai voice, speech synthesis, voice over, generate speech, ai narrator, voice cloning, text to audio, elevenlabs, eleven labs, voice ai, ai voiceover, speech generator, natural voice, inworld, inworld tts, character voice, game voice, npc voice
0ai-music-generation
0gpt-image
0