vision-multimodal

Installation
SKILL.md

Vision & Multimodal Skill

Leverage Claude's vision capabilities for image analysis, document processing, and multimodal understanding.

When to Use This Skill

  • Image analysis and description
  • Document/PDF processing
  • Screenshot analysis
  • OCR-like text extraction
  • Visual comparison
  • Chart and diagram interpretation

Supported Formats

Format Status Best For
JPEG Photos, natural scenes
PNG Screenshots, UI, text
Related skills
Installs
276
GitHub Stars
11
First Seen
Jan 24, 2026