addon-docling-legal-chunk-embed

Installation
SKILL.md

Add-on: Docling Legal Chunk + Embed

Use this skill when a project needs legal-focused document ingestion from PDF into markdown/chunks suitable for retrieval and downstream clause reasoning.

Compatibility

  • Works with architect-python-uv-batch.
  • Works with architect-python-uv-fastapi-sqlalchemy (worker or async job path).
  • Commonly paired with addon-rag-ingestion-pipeline.

Inputs

Collect:

  • LEGAL_SOURCE_DIR: default data/inbox/legal.
  • CLAUSE_MAX_CHARS: default 1400.
  • CLAUSE_OVERLAP_CHARS: default 120.
  • EMBED_PROVIDER: sentence-transformers | openai.
  • OUTPUT_MODE: markdown+json (default) | json-only.
Related skills
Installs
10
First Seen
Feb 27, 2026