pdf-extractor
Installation
SKILL.md
PDF Data Extraction
Extract text and structured data from PDF documents using a multi-backend approach with automatic fallback.
Overview
This skill provides PDF text extraction with 9 different backends, automatic GPU detection, and intelligent backend selection. The extraction system tries backends in order until one succeeds, producing markdown output optimized for further processing.
Quick Start Workflow
To extract text from PDFs:
- Single file extraction (installed CLI - recommended):
Output: Createsextract-pdfs /path/to/document.pdfdocument.mdin the same directory.