skill-creator
Pass
Audited by Gen Agent Trust Hub on Apr 21, 2026
Risk Level: SAFECOMMAND_EXECUTION
Full Analysis
- Subprocess Command Execution: The skill includes utility scripts (
scripts/run_eval.py,scripts/run_loop.py,scripts/improve_description.py) that utilizesubprocess.run()andsubprocess.Popen()to execute theclaudeCLI and other shell commands. While these are used within the context of benchmarking and testing the created skills, users should ensure they are running these in a secure environment. - Subagent Orchestration: The instructions guide the agent to spawn subagents for parallel execution of test cases and grading. This is a core feature of the skill's design to enable objective evaluation and human-in-the-loop iteration.
- Local Web Server: The
eval-viewer/generate_review.pyscript starts a temporary local HTTP server (default port 3117) to serve a self-contained HTML review page. This is intended to facilitate human review of test results in environments that support browser access.
Audit Metadata