vision

Installation
SKILL.md

vision

Multi-provider vision tool. Call various vision models to describe images. Feed it a prompt + image path, get back a text description.

Quick start

python vision.py [--provider <name>] <image_path> <prompt>

When --provider is omitted, the provider is resolved by: --provider flag > VISION_PROVIDER env > first API key found.

Providers

doubao (豆包 / Volcengine Ark)

  • API key: DOUBAO_API_KEY
  • Default model: doubao-seed-2-0-pro-260215
  • Custom endpoint: DOUBAO_BASE_URL
Installs
73
GitHub Stars
42
First Seen
May 23, 2026
vision — xiincs/claude-code-vision-skill