mineru-document-explorer
SKILL.md
MinerU Document Explorer
PDF reading toolkit via doc-search CLI. Search first, then read relevant pages — never scan an entire PDF.
⚠️ Network capabilities: This skill can optionally call external APIs (PageIndex outline generation, MinerU cloud OCR, embedding/reranker services) and run a local FastAPI server. All network features are opt-in and disabled by default.
Path conventions
SKILL_DIR = <this file's parent directory>
SCRIPTS = SKILL_DIR/scripts
Setup check
Read SKILL_DIR/config-state.json. If missing or setup_complete is not true:
- Read
references/setup.mdand run the installer - After setup, ask the user if they want to configure PageIndex (e.g. "If you have an OpenAI-compatible API key, you can enable PageIndex to auto-generate a document outline — useful for scanned docs or manuals. Want to set it up?")
- If the user provides
pageindex_api_key/pageindex_base_url→ write toSCRIPTS/doc-search/config.yaml; if skipped → continue immediately, do not block