ocr-document-processor

Installation
Summary

Extract text from images and scanned PDFs with support for 100+ languages, table detection, and multiple output formats.

  • Handles PNG, JPEG, TIFF, BMP images and multi-page PDFs with per-page or full-document extraction
  • Supports 100+ languages with auto-detection, language-specific packs, and multi-language document processing
  • Exports to plain text, Markdown, JSON, HTML, and searchable PDFs with confidence scoring and bounding box data
  • Includes intelligent preprocessing (deskew, denoise, contrast enhancement, shadow removal) for low-quality scans and specialized parsers for receipts and business cards
  • Batch processing for directories, configurable page segmentation modes, and per-word confidence assessment for quality validation
SKILL.md

OCR Document Processor

Handle OCR-heavy inputs where text must be recovered from images or scanned pages.

Use This For

  • OCR on images and scanned PDFs
  • Searchable PDF export
  • Structured extraction to text, markdown, JSON, or HTML
  • Table extraction from scanned material
  • Receipt parsing and business card parsing

Workflow

Related skills
Installs
3.9K
GitHub Stars
53
First Seen
Jan 24, 2026