bkit-evals
Installation
SKILL.md
bkit Evals — Skill Quality Evaluation Runner
v2.1.11 Sprint β FR-β2. Wraps
evals/runner.jswith input validation, result persistence, and structured reporting. Replaces the barenode evals/runner.js <skill>invocation that previously required users to remember argv structure and ignored timeout / sandbox concerns.
Arguments
| Argument | Description | Example |
|---|---|---|
run <skill> |
Execute the eval suite for one skill | /bkit-evals run gap-detector |
list |
List all skills that have an eval.yaml definition |
/bkit-evals list |
If no argument is provided, render the same output as list.