waza-runner

Installation
SKILL.md

Skill Eval Runner

Evaluate Agent Skills like you evaluate AI Agents

This skill runs evaluations on other skills to measure their effectiveness using the same patterns that power AI agent evaluations.

When to Use

  • Running quality evaluations on a skill
  • Testing if a skill triggers on correct prompts
  • Measuring skill behavior quality
  • Generating eval reports for CI/CD

Commands

Run Evals

Run evals on <skill-name>
Installs
3
Repository
microsoft/waza
GitHub Stars
987
First Seen
May 13, 2026
waza-runner — microsoft/waza