audio-transcribe

Installation
SKILL.md

Audio Transcribe

Transcribes audio files to text with timestamps. Supports automatic language detection, speaker identification (diarization), and outputs structured JSON with segment-level timing.

Command

npx agent-media@latest audio transcribe --in <path> [options]

Inputs

Option Required Description
--in Yes Input audio file path or URL (supports mp3, wav, m4a, ogg)
--diarize No Enable speaker identification
--language No Language code (auto-detected if not provided)
--speakers No Number of speakers hint for diarization
--out No Output path, filename or directory (default: ./)
Related skills
Installs
136
GitHub Stars
4
First Seen
Jan 20, 2026