pdf-to-markdown

Installation
Summary

Convert PDF files to Markdown with automatic detection of native text vs scanned documents.

  • Supports three conversion modes: auto-detect (default), native text extraction, and OCR for scanned PDFs
  • Fast JavaScript-based processing with no system dependencies; returns JSON output with page count and conversion mode
  • Handles basic table structure preservation and works cross-platform on Windows, macOS, and Linux
  • OCR for scanned documents requires optional tesseract.js setup; complex multi-column layouts may not preserve structure perfectly
SKILL.md

Codex compatibility note:

  • Invoke repository skills with $skill-name in Codex; this mirrored copy rewrites legacy Claude /skill-name references.
  • Prefer the plan-hard skill for planning guidance in this Codex mirror.
  • Task tracker mandate: BEFORE executing any workflow or skill step, create/update task tracking for all steps and keep it synchronized as progress changes.
  • User-question prompts mean to ask the user directly in Codex.
  • Ignore Claude-specific mode-switch instructions when they appear.
  • Strict execution contract: when a user explicitly invokes a skill, execute that skill protocol as written.
  • Subagent authorization: when a skill is user-invoked or AI-detected and its protocol requires subagents, that skill activation authorizes use of the required spawn_agent subagent(s) for that task.
  • Do not skip, reorder, or merge protocol steps unless the user explicitly approves the deviation first.
  • For workflow skills, execute each listed child-skill step explicitly and report step-by-step evidence.
  • If a required step/tool cannot run in this environment, stop and ask the user before adapting.

Codex Project-Reference Loading (No Hooks)

Codex does not receive Claude hook-based doc injection. When coding, planning, debugging, testing, or reviewing, open project docs explicitly using this routing.

Related skills
Installs
683
GitHub Stars
6
First Seen
Jan 24, 2026