transcribe
Transcribe
Speech-to-text with two backends: MLX Whisper (local, Apple Silicon) and Groq API (cloud).
Default mode auto uses local first, cloud as fallback.
Usage
bash scripts/transcribe.sh --input "/path/to/audio.m4a"
Options
| Flag | Description | Default |
|---|---|---|
--input FILE |
Audio/video file path (required) | — |
--backend MODE |
auto, mlx, or groq |
auto |
--model MODEL |
Whisper model name | Per backend |
--language LANG |
Language code (zh, en, ja, etc.) |
zh |
More from kcchien/skills
excalidraw
Create professional diagrams and visualizations using Excalidraw JSON format. Specialized for IT architecture diagrams, flowcharts, network topology, system design, microservices, ER diagrams, and state machines. Includes curated color palettes and visual styles. Use when working with .excalidraw files, or when user mentions diagrams, flowcharts, architecture visualization, or drawing. Delegates to subagents to prevent context exhaustion from verbose JSON.
5industrial-expert
>
4vscode-extension-uiux
|
2crisp-reading
>
2agent-browser
Browser automation CLI for AI agents. Use when the user needs to interact with websites, including navigating pages, filling forms, clicking buttons, taking screenshots, extracting data, testing web apps, or automating any browser task. Triggers include requests to "open a website", "fill out a form", "click a button", "take a screenshot", "scrape data from a page", "test this web app", "login to a site", "automate browser actions", or any task requiring programmatic web interaction.
2adapt
Adapt designs to work across different screen sizes, devices, contexts, or platforms. Ensures consistent experience across varied environments.
1