pdf-to-md

Installation
SKILL.md

PDF → Markdown

PDFs route by content type. The script does deterministic extraction; you (the agent) describe any embedded images using your built-in Vision capability.

Routing

Input Path Notes
Native-text PDF (avg >50 chars/page) pdf_to_md.py (pymupdf) Seconds, zero API cost
Scanned / image PDF ocr_extract.py (PaddleOCR) Auto-skipped from fast path; OCR vendor handles it

Workflow (agent mode — default, zero config)

Step 1 — Run the extractor

Related skills
Installs
1
First Seen
9 days ago