qwencloud-audio-tts
SKILL.md
Agent setup: If your agent doesn't auto-load skills (e.g. Claude Code), see agent-compatibility.md once per session.
Qwen Audio TTS (Text-to-Speech)
Synthesize natural speech from text using Qwen TTS models. This skill is part of qwencloud/qwencloud-ai.
Skill directory
Use this skill's internal files to execute and learn. Load reference files on demand when the default path fails or you need details.
| Location | Purpose |
|---|---|
scripts/tts.py |
Qwen TTS (HTTP API) — qwen3-tts-flash, qwen3-tts-instruct-flash |
scripts/tts_cosyvoice.py |
CosyVoice (WebSocket API) — requires dashscope SDK |
references/cosyvoice-guide.md |
CosyVoice setup, voices, examples, errors |
references/execution-guide.md |
Fallback: curl (standard, instruct, streaming), code generation |
references/prompt-guide.md |
Text formatting for speech, instructions templates, voice selection |