nat-evaluation
Installation
SKILL.md
NeMo Agent Toolkit Evaluation
Use this skill for measuring agent quality and behavior.
Workflow
- Decide the evaluation surface and output format.
- Decompose quality goals into separate evaluators.
- Choose built-in evaluators before writing custom evaluators.
- Keep datasets small and explicit for local validation.
- Run
nat evaland inspect generated artifacts.