transcribe
Installation
SKILL.md
VoxFlow Transcribe / Dub / Translate Skill
Audio/video → text → other languages → re-voiced video. Five tightly-related commands:
| Command | What it does | Output |
|---|---|---|
asr (alias transcribe) |
Audio/video → text. Cloud (Tencent), local (Whisper), or Azure Batch for 30-min+ files. | SRT / TXT / JSON |
asr-jobs |
Browse, inspect, cancel, or download long-running Azure jobs | list / show / cancel / download |
translate |
Translate SRT / text / file | SRT / TXT |
dub |
SRT → timeline-aligned TTS, optionally merged into video | WAV / MP4 |
video-translate |
End-to-end: ASR → translate → dub → merge MP4 | MP4 in target language |
summarize |
Audio/video/text → summary slides (PPTX, optional video) | PPTX / MP4 |
Prerequisites
npm install -g voxflowandvoxflow loginffmpeginstalled (brew install ffmpeg/sudo apt install ffmpeg) — required fordub --video,video-translate, audio extraction- Optional:
whisper.cppfor local engine (no quota cost). Install viabrew install whisper-cppor compile from source. - Optional:
sox/recfor--micrecording.