gemini-image
Analyze images using Gemini's vision capabilities for OCR, UI analysis, and visual understanding.
- Supports PNG, JPEG, GIF, and WebP images including screenshots, diagrams, charts, and code snippets
- Built-in analysis templates for common tasks: text extraction, code recovery, UI/UX feedback, error diagnosis, and data extraction from charts
- Handles single and multiple image comparisons in a single request
- Requires Google Generative AI library and valid GEMINI_API_KEY environment variable
Gemini Image Analysis
Analyze images using Gemini Pro's vision capabilities.
Prerequisites
pip install google-generativeai
export GEMINI_API_KEY=your_api_key
CLI Reference
Basic Image Analysis
# Analyze an image
gemini -m pro -f /path/to/image.png "Describe this image in detail"
More from johnlindquist/claude
memory
Persistent knowledge storage using basic-memory CLI. Use to save notes, search memories semantically, and build context for topics across sessions.
264brainstorm
Generate ideas and explore possibilities with AI. Use for creative problem solving, generating alternatives, and expanding on concepts.
217deepwiki
Query DeepWiki for repository documentation and structure. Use to understand open source projects, find API docs, and explore codebases.
210raycast-extension
Build Raycast extensions with React and TypeScript. Use when the user asks to create a Raycast extension, command, or tool.
201think
Deep multi-framework reasoning using Gemini. Use for complex problem analysis, challenging ideas, and evaluating multiple options with structured thinking.
178spider
Web crawling and scraping with analysis. Use for crawling websites, security scanning, and extracting information from web pages.
149