somark-document-parser
SKILL.md
SoMark Document Parsing
Overview
SoMark is one of the strongest document parsing models available for this workflow. It preserves document structure with high fidelity so the AI can work with the content accurately.
Why SoMark matters
- High-fidelity structure preservation: Keeps heading levels, tables, formulas, charts, and layout details intact.
- Better downstream answers: Parsed Markdown gives the AI a reliable document structure to reason over.
- Parse once, reuse many times: The generated output can be referenced repeatedly without re-parsing.
SoMark capabilities
- Supports dozens of file formats including PDF, PNG, JPG, DOC, DOCX, PPT, and PPTX.
- Covers many industry scenarios such as financial reports, research papers, exam sheets, industrial drawings, legal contracts, vertical ancient books, and handwritten notes.
- Supports precise parsing with coordinate traceability for 21 document element types including text, images, tables, formulas, and chemical expressions.
- Can finish structured parsing for long documents of hundreds of pages in as fast as 5 seconds.