self-improving-agent
Pass
Audited by Gen Agent Trust Hub on Mar 15, 2026
Risk Level: SAFECOMMAND_EXECUTIONPROMPT_INJECTION
Full Analysis
- [COMMAND_EXECUTION]: The skill provides several shell scripts for task automation.
activator.shanderror-detector.share used within agent hooks to monitor and report on session status, whileextract-skill.shautomates the creation of new skill directories and files from templates. Theextract-skill.shscript includes security checks to prevent directory traversal attacks. - [PROMPT_INJECTION]: The skill's primary function is to modify agent behavior over time by 'promoting' logged learnings into persistent context files like
CLAUDE.mdorSOUL.md. It also utilizes hook scripts to inject guidance reminders directly into the agent's prompt during active sessions. This creates a surface for indirect prompt injection: - Ingestion points: Tool outputs are captured via
error-detector.sh, and user corrections are ingested through prompt interactions. - Boundary markers: Logged data is organized using markdown headers but lacks explicit safety delimiters to prevent the agent from potentially obeying instructions embedded within the logged data.
- Capability inventory: The skill allows the agent to read/write local files and execute the provided helper scripts.
- Sanitization: While
extract-skill.shvalidates directory names, the actual content of the error and learning logs is not sanitized for executable instructions or prompt injection patterns.
Audit Metadata