Error Analysis

Guide the user through reading LLM pipeline traces and building a catalog of how the system fails.

Overview

Capture the full trace: input, all intermediate LLM calls, tool uses, retrieved documents, reasoning steps, and final output.

Target: ~100 traces. This is roughly where new traces stop revealing new kinds of failures. The number depends on system complexity.

Installs

398

Repository

GitHub Stars

1.4K

First Seen

Mar 3, 2026

Security Audits