gemini-image

Installation
Summary

Analyze images using Gemini's vision capabilities for OCR, UI analysis, and visual understanding.

  • Supports PNG, JPEG, GIF, and WebP images including screenshots, diagrams, charts, and code snippets
  • Built-in analysis templates for common tasks: text extraction, code recovery, UI/UX feedback, error diagnosis, and data extraction from charts
  • Handles single and multiple image comparisons in a single request
  • Requires Google Generative AI library and valid GEMINI_API_KEY environment variable
SKILL.md

Gemini Image Analysis

Analyze images using Gemini Pro's vision capabilities.

Prerequisites

pip install google-generativeai
export GEMINI_API_KEY=your_api_key

CLI Reference

Basic Image Analysis

# Analyze an image
gemini -m pro -f /path/to/image.png "Describe this image in detail"
Related skills
Installs
993
GitHub Stars
23
First Seen
Jan 21, 2026