waxa-eval
Installation
SKILL.md
waxa-eval
Empirical evaluation loop for skill prompts, codified from real iter runs.
This skill is the operating manual for waxa; the CLI itself lives in
tools/waxa/ (see its README for argument-level reference).
waxa extends microsoft/waza with
empirical-prompt-tuning semantics on top: forced Self-report, ledger
across iterations, four grader types, and convergence detection.
When to invoke
Explicit user request only:
- "evaluate this skill with waxa"
- "iterate on
<skill>until it converges" - "add a waxa scenario for
<scenario>" - "interpret these unclear-points / Self-report"