qianfanocr-document-intelligence

Installation
SKILL.md

Qianfan OCR Document Intelligence

This skill orchestrates visual understanding for images and PDFs. It does not implement a vision model itself. It selects the right analysis mode, prepares inputs, invokes the bundled CLI, and returns a structured result for the upstream agent.

Required Execution Order

Always follow this order:

  1. Check whether QIANFAN_TOKEN is already available.
  2. If the token is missing, stop immediately and ask the user for the API Key.
  3. If the user provides the API Key, write it to <skill-root>/.env as QIANFAN_TOKEN=....
  4. Only after the token is available, continue to mode selection, reference loading, and CLI calls.

This token preflight takes precedence over all later rules in this skill. Do not read references/*.md, do not select a mode, and do not call any bundled script until the token check has passed.

Related skills

More from baidubce/skills

Installs
26
Repository
baidubce/skills
GitHub Stars
20
First Seen
Mar 23, 2026