nat-evaluation

Installation
SKILL.md

NeMo Agent Toolkit Evaluation

Use this skill for measuring agent quality and behavior.

Workflow

  1. Decide the evaluation surface and output format.
  2. Decompose quality goals into separate evaluators.
  3. Choose built-in evaluators before writing custom evaluators.
  4. Keep datasets small and explicit for local validation.
  5. Run nat eval and inspect generated artifacts.

References

Installs
1
GitHub Stars
2.4K
First Seen
Jun 10, 2026
nat-evaluation — nvidia/nemo-agent-toolkit