pdf-process-mineru
SKILL.md
Tool List
1. pdf_to_markdown
Convert PDF documents to Markdown format, preserving document structure, formulas, tables, and images.
Description: Use MinerU to parse PDF documents and output in Markdown format, supporting OCR, formula recognition, table extraction, and other features.
Parameters:
file_path(string, required): Absolute path to the PDF fileoutput_dir(string, required): Absolute path to the output directorybackend(string, optional): Parsing backend, options:hybrid-auto-engine(default),pipeline,vlm-auto-enginelanguage(string, optional): OCR language code, such asen(English),ch(Chinese),ja(Japanese), etc., defaults to auto-detectionenable_formula(boolean, optional): Whether to enable formula recognition, defaults to trueenable_table(boolean, optional): Whether to enable table extraction, defaults to truestart_page(integer, optional): Start page number (starting from 0), defaults to 0end_page(integer, optional): End page number (starting from 0), defaults to -1 meaning parse all pages
Return Value: