audio-understanding

Installation
SKILL.md

Audio Understanding: Audio transcription and analysis with Gemini

File support

This skill supports audio analysis using Google Gemini models. Supported formats:

Category Extensions
Audio .mp3, .wav, .m4a, .ogg, .flac
  • Local audio files up to 9.5 hours long
  • YouTube links (youtube.com/watch, youtu.be, youtube.com/embed)

Reference: https://ai.google.dev/gemini-api/docs/audio?example=dialogue

How to use

bash ${CLAUDE_PLUGIN_ROOT}/scripts/gemini.sh --file=AUDIO_PATH "YOUR QUESTION ABOUT THE AUDIO"
Related skills
Installs
5
GitHub Stars
1
First Seen
Feb 9, 2026