Vision Recognition + OCR

Cross-platform Python: on Windows prefer py -3.11; on Linux/macOS prefer python3; if plain python already points to Python 3, it also works.

Recognize vehicles, animals, and plants, or extract text from screenshots, photos, invoices, and tables via Baidu vision APIs. This skill combines lightweight classification and OCR workflows in one place.

Why install this

Use this skill when you want to:

identify a car, animal, or plant from an image
extract text from screenshots, invoices, handwriting, or tables
send either a local path, public URL, or base64 image into the same tool family

Common use cases

识别车型 / 看图识别动物或植物
提取截图、票据、表格中的文字
对同一张图在“识别类别”和“OCR 提取”之间切换

vision-recognition-ocr

Vision Recognition + OCR

Why install this

Common use cases