doc-parse
SKILL.md
Doc Parse
Parse Word (.doc/.docx) documents into structured Markdown using MinerU. Preserves document hierarchy including headings, lists, tables, and paragraphs.
Install
npm install -g mineru-open-api
# or via Go (macOS/Linux):
go install github.com/opendatalab/MinerU-Ecosystem/cli/mineru-open-api@latest
Quick Start
# Quick parse from .docx (no token required)
mineru-open-api flash-extract report.docx
# Save structured Markdown to directory