skills/modelscope.cn/pdf-to-markdown-converter

pdf-to-markdown-converter

SKILL.md

PDF to Markdown Converter

Overview

This skill converts PDF files to Markdown format using the PyMuPDF (fitz) library. It extracts text content and saves embedded images as separate PNG files. This is a fast and lightweight alternative to Docling.

Special Features:

  • ✅ Automatically fixes LaTeX-style umlauts (¨a, ¨o, ¨u → ä, ö, ü)
  • ✅ Converts ß to ss (Swiss German style)
  • ✅ Corrects common LaTeX encoding issues
  • ✅ Preserves text layout
  • ✅ Extracts all images as PNG files

Prerequisites

Ensure PyMuPDF is installed (installs in ~10-20 seconds):

pip install PyMuPDF --break-system-packages
Installs
4
First Seen
Mar 24, 2026