voice-generation

Installation
SKILL.md

Voice Generation Skill

Generate realistic speech using AI (Google Gemini TTS, ElevenLabs, OpenAI TTS).

Prerequisites

At least one API key is required:

  • GOOGLE_API_KEY - For Google Gemini TTS (same key as video/image/music) ✅
  • ELEVENLABS_API_KEY - For ElevenLabs high-quality voice synthesis
  • OPENAI_API_KEY - For OpenAI TTS voices

Available APIs

Google Gemini TTS (Recommended - Same API Key)

  • Best for: Podcasts, dialogues, audiobooks with style control
  • Voices: 30 voices with natural language style control
  • Multi-speaker: Up to 2 speakers for dialogues ✅
  • Languages: 24 languages (auto-detected)
Related skills
Installs
18
GitHub Stars
12
First Seen
Jan 28, 2026