voice-generation

Installation

SKILL.md

Voice Generation Skill

Generate realistic speech using AI (Google Gemini TTS, ElevenLabs, OpenAI TTS).

Prerequisites

At least one API key is required:

GOOGLE_API_KEY - For Google Gemini TTS (same key as video/image/music) ✅
ELEVENLABS_API_KEY - For ElevenLabs high-quality voice synthesis
OPENAI_API_KEY - For OpenAI TTS voices

Available APIs

Google Gemini TTS (Recommended - Same API Key)

Best for: Podcasts, dialogues, audiobooks with style control
Voices: 30 voices with natural language style control
Multi-speaker: Up to 2 speakers for dialogues ✅
Languages: 24 languages (auto-detected)

Related skills

More from michaelboeding/skills

Installs

18

Repository

michaelboeding/skills

GitHub Stars

12

First Seen

Jan 28, 2026

Security Audits

Gen Agent Trust HubPass