pdf-to-markdown-converter
SKILL.md
PDF to Markdown Converter
Overview
This skill converts PDF files to Markdown format using the PyMuPDF (fitz) library. It extracts text content and saves embedded images as separate PNG files. This is a fast and lightweight alternative to Docling.
Special Features:
- ✅ Automatically fixes LaTeX-style umlauts (¨a, ¨o, ¨u → ä, ö, ü)
- ✅ Converts ß to ss (Swiss German style)
- ✅ Corrects common LaTeX encoding issues
- ✅ Preserves text layout
- ✅ Extracts all images as PNG files
Prerequisites
Ensure PyMuPDF is installed (installs in ~10-20 seconds):
pip install PyMuPDF --break-system-packages