pdf

Installation
Summary

Comprehensive PDF processing covering text extraction, merging, splitting, OCR, form filling, and encryption.

  • Supports multiple Python libraries: pypdf for basic operations (merge, split, rotate, encrypt), pdfplumber for text and table extraction with layout preservation, and reportlab for creating new PDFs from scratch
  • Includes command-line tools (pdftotext, qpdf, pdftk) for batch processing and scripting workflows
  • Handles scanned PDFs via OCR using pytesseract and pdf2image to make them searchable and extract text
  • Covers watermarking, image extraction, metadata access, password protection, and form filling with dedicated reference documentation
SKILL.md

PDF Processing Guide

Overview

This guide covers essential PDF processing operations using Python libraries and command-line tools. For advanced features, JavaScript libraries, and detailed examples, see REFERENCE.md. If you need to fill out a PDF form, read FORMS.md and follow its instructions.

Quick Start

from pypdf import PdfReader, PdfWriter

# Read a PDF
reader = PdfReader("document.pdf")
print(f"Pages: {len(reader.pages)}")

# Extract text
text = ""
for page in reader.pages:
    text += page.extract_text()
Related skills

More from davila7/claude-code-templates

Installs
596
GitHub Stars
27.2K
First Seen
Jan 21, 2026