skill-creator
Pass
Audited by Gen Agent Trust Hub on May 16, 2026
Risk Level: SAFECOMMAND_EXECUTIONPROMPT_INJECTION
Full Analysis
- [COMMAND_EXECUTION]: The skill executes local shell commands to support its development and testing workflow.
- Evidence in
scripts/run_eval.py: Usessubprocess.Popento call theclaudeCLI with user-provided queries to test skill triggering thresholds. - Evidence in
eval-viewer/generate_review.py: Usessubprocess.runto execute thelsofutility for port management when starting the local feedback server. - [PROMPT_INJECTION]: The skill contains an indirect prompt injection surface due to the ingestion of external data during the optimization loop.
- Ingestion points: Processes test cases from
eval_set.json(ingested inscripts/run_loop.py) and qualitative feedback fromfeedback.json(as described in theSKILL.mdworkflow). - Boundary markers: The skill does not implement explicit delimiters or instructional guarding when interpolating these external strings into agent prompts.
- Capability inventory: The skill has the ability to execute shell commands via the CLI, write files to the local directory (to update
SKILL.md), and spawn subagents for autonomous evaluation. - Sanitization: Input data from the evaluation and feedback files is used directly without sanitization or structural validation against injection patterns.
Audit Metadata