experiment-engine
Pass
Audited by Gen Agent Trust Hub on Jun 22, 2026
Risk Level: SAFECOMMAND_EXECUTION
Full Analysis
- [COMMAND_EXECUTION]: The skill utilizes shell commands for build automation, testing, and performance measurement.
- Evidence: Use of
npx next build,npx jest, andcurlto gather metrics. - Evidence: Systematic use of Git commands (
git stash,git checkout .,git commit) for local state management and safe rollback of experiments. - [SAFE]: The skill describes a legitimate software engineering workflow for performance and quality optimization.
- The references to other agents (profiler, nitro, verifier) and the
experiment-loopskill are part of the intended ecosystem integration. - Code modification is performed via an LLM tool (
claude) with the explicit goal of optimization, following a defined protocol with a human-verifiable rollback strategy.
Audit Metadata