Qianfan OCR Document Intelligence

This skill orchestrates visual understanding for images and PDFs. It does not implement a vision model itself. It selects the right analysis mode, prepares inputs, invokes the bundled CLI, and returns a structured result for the upstream agent.

Required Execution Order

Always follow this order:

Check whether QIANFAN_TOKEN is already available.
If the token is missing, stop immediately and ask the user for the API Key.
If the user provides the API Key, write it to <skill-root>/.env as QIANFAN_TOKEN=....
Only after the token is available, continue to mode selection, reference loading, and CLI calls.

This token preflight takes precedence over all later rules in this skill. Do not read references/*.md, do not select a mode, and do not call any bundled script until the token check has passed.

qianfanocr-document-intelligence

Qianfan OCR Document Intelligence

Required Execution Order

More from baidubce/skills

famou-experiment-manager

baidu-search

template-skill

famou-data-analysis

medical-bill-organizer

famou-artifact-generator