skill-creator
Pass
Audited by Gen Agent Trust Hub on Apr 13, 2026
Risk Level: SAFECOMMAND_EXECUTION
Full Analysis
- [COMMAND_EXECUTION]: The skill uses subprocess calls to execute its own helper scripts and the Claude CLI for evaluation and packaging tasks.
eval-viewer/generate_review.pyusessubprocess.runwithlsofto manage the review server port.scripts/run_eval.pyusessubprocess.Popento invoke theclaudeCLI for trigger testing.SKILL.mdinstructs the agent to run various Python scripts for benchmark aggregation, packaging, and optimization loops.- [EXTERNAL_DOWNLOADS]: The skill references standard web dependencies for its HTML viewer (Google Fonts, SheetJS CDN).
- [REMOTE_CODE_EXECUTION]: The skill utilizes subagents to execute test prompts and grading tasks, which is the intended functional design for its iterative improvement loop.
Audit Metadata