deepeval
Installation
SKILL.md
DeepEval
Use this skill to add an end-to-end eval loop to AI applications: instrument the app, curate or reuse a dataset, create a committed pytest eval suite, run evals, and iterate on failures.
Workflow Summary
- Inspect the target app and existing DeepEval usage.
- Ask the required intake questions.
- Reuse existing metrics and datasets when available.
- Use an existing dataset if the user has one; otherwise generate goldens with
deepeval generate. - Prefer native DeepEval integrations, then add minimal tracing add-ons.
- Run
deepeval test run. - Iterate for the requested number of rounds, defaulting to 5.