Gemini Video Understanding Skill

This skill enables comprehensive video analysis using Google's Gemini API, including video summarization, question answering, transcription, timestamp references, and more.

Capabilities

Video Summarization: Create concise summaries of video content
Question Answering: Answer specific questions about video content
Transcription: Transcribe audio with visual descriptions and timestamps
Timestamp References: Query specific moments in videos (MM:SS format)
Video Clipping: Process specific segments using start/end offsets
Multiple Videos: Compare and analyze up to 10 videos (Gemini 2.5+)
YouTube Support: Analyze YouTube videos directly (preview feature)
Custom Frame Rate: Adjust FPS sampling for different video types

Supported Formats

MP4, MPEG, MOV, AVI, FLV, MPG, WebM, WMV, 3GPP

gemini-video-understanding

Gemini Video Understanding Skill

Capabilities

Supported Formats

More from aia-11-hn-mib/mib-mockinterviewaibot

imagemagick

remix-icon

obsidian-qa-saver

repomix

gemini-vision

sequential-thinking