waza-runner
Pass
Audited by Gen Agent Trust Hub on May 13, 2026
Risk Level: SAFECOMMAND_EXECUTION
Full Analysis
- Configuration-Driven Execution: The skill parses configuration files to execute Python assertions and external scripts. While this constitutes dynamic execution, it is the primary function of the evaluation runner for testing skill performance.
- Input Processing: The skill ingests data from local task definitions and evaluation specs. Users should ensure these configuration files come from trusted sources as they determine the evaluation logic.
Audit Metadata