long-document-llm-pipeline
Installation
SKILL.md
Long Document LLM Processing Pipeline
Extracted: 2026-02-08 (updated 2026-02-09) Context: When processing documents over ~50K characters through LLM APIs for extraction, generation, or analysis tasks.
Problem
Sending large documents (>50K chars) as a single LLM prompt causes:
- Lost in the Middle - LLMs lose attention on content in the middle of long inputs (30%+ accuracy drop, per Liu et al. 2023)
- High cost - Entire document becomes input tokens even if only portions are relevant
- No partial retry - If generation fails, must re-process the entire document
- No parallelism - Single sequential API call