transcribe

Installation
SKILL.md

Audio Transcription Pipeline

VAD-first, evidence-backed Whisper transcription running locally on Apple Silicon. Produces high-quality transcripts with zero hallucinations, correct domain terminology, and LLM-polished output.

Architecture: Audio → ffmpeg (16kHz WAV) → Silero-VAD (speech segmentation) → MLX Whisper → Dictionary replacement → Claude LLM correction → Output.

Prerequisites

  • macOS with Apple Silicon (M1/M2/M3/M4) — required for MLX
  • Python 3.9+ — Xcode Python at /Library/Developer/CommandLineTools/usr/bin/python3 or Homebrew Python
  • ffmpeg — for audio preprocessing (brew install ffmpeg)
  • Anthropic API key — for LLM correction step (optional but recommended)

Setup

Detection Order

Check if the pipeline is already installed:

Related skills

More from simonstrumse/vibelabs-skills

Installs
4
First Seen
Feb 28, 2026