Failure Taxonomy Builder

Transform raw, freeform trace annotations from open coding sessions into a structured taxonomy of binary failure modes, following the grounded theory methodology from the Analyze-Measure-Improve evaluation lifecycle.

When This Skill Applies

The user has already completed open coding — they've read through LLM pipeline traces and written short, freeform notes describing what went wrong (the "point of first failure"). Now they need to move from that chaotic pile of observations into an organized, actionable taxonomy. This is the axial coding step.

Typical inputs look like a JSON array, CSV, or spreadsheet of objects with fields like:

trace_id — identifier for the trace
annotation or note — the freeform open-coded observation
Optionally: pass_fail, trace_summary, query, or the full trace itself

failure-taxonomy

Failure Taxonomy Builder

When This Skill Applies

Core Workflow

More from maragudk/evals-skills

llm-as-a-judge

prompt-engineering

trace-annotation-tool