vision
Installation
SKILL.md
vision
Multi-provider vision tool. Call various vision models to describe images. Feed it a prompt + image path, get back a text description.
Quick start
python vision.py [--provider <name>] <image_path> <prompt>
When --provider is omitted, the provider is resolved by: --provider flag > VISION_PROVIDER env > first API key found.
Providers
doubao (豆包 / Volcengine Ark)
- API key:
DOUBAO_API_KEY - Default model:
doubao-seed-2-0-pro-260215 - Custom endpoint:
DOUBAO_BASE_URL