tts-voiceover
Installation
SKILL.md
TTS Voice Over Skill
Generates per-slide WAV voice-over files from YAML speaker_notes using Azure Speech SDK with SSML pronunciation control.
Overview
This skill reads content.yaml files from a PowerPoint skill content directory, extracts speaker_notes fields, applies SSML acronym aliases for correct pronunciation of technical terms, and produces one WAV file per slide. Supports dry-run mode for SSML template verification without Azure credentials.
Prerequisites
- Azure Speech resource — Free tier provides 500K characters per month.
- Authentication — Key-based (
SPEECH_KEY) or Microsoft Entra ID (SPEECH_RESOURCE_ID). - Python 3.11+ with
uvfor virtual environment management.