gemini-vision
SKILL.md
Gemini Vision API Skill
This skill enables Claude to use Google's Gemini API for advanced image understanding tasks including captioning, classification, visual question answering, object detection, segmentation, and multi-image analysis.
Quick Start
Prerequisites
- Get API Key: Obtain from Google AI Studio
- Install SDK:
pip install google-genai(Python 3.9+)
API Key Configuration
The skill checks for GEMINI_API_KEY in this order:
- Process environment variable (recommended)
export GEMINI_API_KEY="your-api-key"