rag-auditor

Installation
SKILL.md

RAG Auditor

Systematic RAG pipeline evaluation across the full retrieval-generation chain: designs evaluation query sets, measures retrieval metrics (Precision@K, Recall@K, MRR), evaluates generation quality (groundedness, completeness, hallucination rate), diagnoses component-level failures, and recommends targeted improvements.

Reference Files

File Contents Load When
references/retrieval-metrics.md Precision@K, Recall@K, MRR, NDCG definitions and calculation Always
references/generation-metrics.md Groundedness, completeness, hallucination detection methods Generation evaluation needed
references/failure-taxonomy.md RAG failure categories: retrieval, generation, chunking, embedding Failure diagnosis needed
references/diagnostic-queries.md Designing evaluation query sets, known-answer questions, difficulty levels Evaluation setup

Prerequisites

  • Access to the RAG pipeline (or its outputs for post-hoc evaluation)
Related skills

More from mathews-tom/armory

Installs
43
GitHub Stars
230
First Seen
Mar 23, 2026