deepeval

Installation
SKILL.md

DeepEval

Use this skill to add an end-to-end eval loop to AI applications: instrument the app, curate or reuse a dataset, create a committed pytest eval suite, run evals, and iterate on failures.

Workflow Summary

  1. Inspect the target app and existing DeepEval usage.
  2. Ask the required intake questions.
  3. Reuse existing metrics and datasets when available.
  4. Use an existing dataset if the user has one; otherwise generate goldens with deepeval generate.
  5. Prefer native DeepEval integrations, then add minimal tracing add-ons.
  6. Run deepeval test run.
  7. Iterate for the requested number of rounds, defaulting to 5.

Core Principles

Installs
58
GitHub Stars
15.4K
First Seen
Apr 28, 2026