alicloud-ai-audio-tts-voice-clone
Voice cloning and text-to-speech synthesis using Alibaba Cloud Qwen TTS VC models.
- Supports two model variants: standard batch processing (
qwen3-tts-vc-2026-01-22) and real-time streaming (qwen3-tts-vc-realtime-2026-01-15) - Accepts voice samples as file paths or raw bytes; generates cloned voice IDs for reuse across multiple synthesis requests
- Normalized interface handles text input, voice enrollment, optional streaming output, and returns audio URLs or PCM chunks
- Requires
DASHSCOPE_API_KEYenvironment variable and thedashscopePython SDK; includes validation script and local helper for request preparation
Category: provider
Model Studio Qwen TTS Voice Clone
Use voice cloning models to replicate timbre from enrollment audio samples.
Critical model names
Use one of these exact model strings:
qwen3-tts-vc-2026-01-22qwen3-tts-vc-realtime-2026-01-15
Prerequisites
- Install SDK in a virtual environment:
More from cinience/alicloud-skills
alicloud-ai-image-qwen-image
Generate images with Model Studio DashScope SDK using Qwen Image generation models (qwen-image, qwen-image-plus, qwen-image-max, qwen-image-2.0 series and snapshots). Use when implementing or documenting image.generate requests/responses, mapping prompt/negative_prompt/size/seed/reference_image, or integrating image generation into the video-agent pipeline.
366alicloud-observability-sls-log-query
Query and troubleshoot logs in Alibaba Cloud Log Service (SLS) using query|analysis syntax and the Python SDK. Use for time-bounded log search, error investigation, and root-cause analysis workflows.
340alicloud-ai-multimodal-qwen-vl
Understand images with Alibaba Cloud Model Studio Qwen VL models (qwen3-vl-plus/qwen3-vl-flash and latest aliases). Use when building image Q&A, visual analysis, OCR-like extraction, chart/table reading, or screenshot understanding workflows.
337alicloud-ai-image-qwen-image-edit
Edit images with Alibaba Cloud Model Studio Qwen Image Edit models (qwen-image-edit, qwen-image-edit-plus, qwen-image-edit-max, qwen-image-2.0 series and snapshots). Use when modifying existing images (inpaint, replace, style transfer, local edits), preserving subject consistency, or documenting image edit request/response mappings.
335alicloud-ai-audio-tts
Generate human-like speech audio with Model Studio DashScope Qwen TTS models (qwen3-tts-flash, qwen3-tts-instruct-flash). Use when converting text to speech, producing voice lines for short drama/news videos, or documenting TTS request/response fields for DashScope.
314alicloud-ai-video-wan-video
Generate videos with Model Studio DashScope SDK using Wan video generation models (wan2.6-t2v, wan2.6-i2v-flash, wan2.6-i2v and regional variants). Use when implementing or documenting video.generate requests/responses, mapping prompt/negative_prompt/duration/fps/size/seed/reference_image/motion_strength, or integrating video generation into the video-agent pipeline.
307