transcribe

Installation
SKILL.md

VoxFlow Transcribe / Dub / Translate Skill

Audio/video → text → other languages → re-voiced video. Five tightly-related commands:

Command What it does Output
asr (alias transcribe) Audio/video → text. Cloud (Tencent), local (Whisper), or Azure Batch for 30-min+ files. SRT / TXT / JSON
asr-jobs Browse, inspect, cancel, or download long-running Azure jobs list / show / cancel / download
translate Translate SRT / text / file SRT / TXT
dub SRT → timeline-aligned TTS, optionally merged into video WAV / MP4
video-translate End-to-end: ASR → translate → dub → merge MP4 MP4 in target language
summarize Audio/video/text → summary slides (PPTX, optional video) PPTX / MP4

Prerequisites

  • npm install -g voxflow and voxflow login
  • ffmpeg installed (brew install ffmpeg / sudo apt install ffmpeg) — required for dub --video, video-translate, audio extraction
  • Optional: whisper.cpp for local engine (no quota cost). Install via brew install whisper-cpp or compile from source.
  • Optional: sox / rec for --mic recording.
Installs
13
GitHub Stars
7
First Seen
May 6, 2026
transcribe — voxflowstudio/skills