vision-recognition-ocr
SKILL.md
Vision Recognition + OCR
Cross-platform Python: on Windows prefer
py -3.11; on Linux/macOS preferpython3; if plainpythonalready points to Python 3, it also works.
Recognize vehicles, animals, and plants, or extract text from screenshots, photos, invoices, and tables via Baidu vision APIs. This skill combines lightweight classification and OCR workflows in one place.
Why install this
Use this skill when you want to:
- identify a car, animal, or plant from an image
- extract text from screenshots, invoices, handwriting, or tables
- send either a local path, public URL, or base64 image into the same tool family
Common use cases
- 识别车型 / 看图识别动物或植物
- 提取截图、票据、表格中的文字
- 对同一张图在“识别类别”和“OCR 提取”之间切换