chunking-strategy

Installation
Summary

Optimal chunking strategies for RAG systems and document processing pipelines.

  • Five strategy levels from fixed-size to advanced methods (late chunking, contextual retrieval), each suited to different document types and complexity
  • Includes recursive character chunking with hierarchical separators, structure-aware chunking for Markdown/code/PDFs, and embedding-based semantic chunking with configurable thresholds
  • Provides evaluation framework covering retrieval precision, recall, end-to-end accuracy, processing latency, and resource usage
  • Best practices emphasize starting simple (512 tokens, 10-20% overlap), testing with representative documents, and iterating based on document characteristics and retrieval performance
SKILL.md

Chunking Strategy for RAG Systems

Overview

Provides chunking strategies for RAG systems, vector databases, and document processing. Recommends chunk sizes, overlap percentages, and boundary detection methods; validates semantic coherence; evaluates retrieval metrics.

When to Use

Use when building or optimizing RAG systems, vector search pipelines, document chunking workflows, or performance-tuning existing systems with poor retrieval quality.

Instructions

Choose Chunking Strategy

Select based on document type and use case:

  1. Fixed-Size Chunking (Level 1)
    • Use for simple documents without clear structure
    • Start with 512 tokens and 10-20% overlap
Related skills

More from giuseppe-trisciuoglio/developer-kit

Installs
911
GitHub Stars
246
First Seen
Feb 3, 2026