continuous-learning-v2
Warn
Audited by Gen Agent Trust Hub on May 19, 2026
Risk Level: MEDIUMDATA_EXFILTRATIONEXTERNAL_DOWNLOADSCOMMAND_EXECUTIONPROMPT_INJECTION
Full Analysis
- [DATA_EXFILTRATION]: The skill records extensive session logs (tool inputs, outputs, and user prompts) into a local data directory. While it implements regex-based redaction for secrets, the comprehensive logging of developer activity is inherently sensitive. These logs are subsequently sent to an external large language model (Claude Haiku) for automated pattern analysis.
- [EXTERNAL_DOWNLOADS]: The
instinct-cli.pyscript contains an import feature that allows fetching behavioral definition files directly from arbitrary URLs usingurllib.request.urlopen. - [COMMAND_EXECUTION]: The system relies on executing various shell commands and subprocesses, including the
claudeCLI,git, and system utilities likenohupandpgrep, to manage its background learning loop and project detection. - [PROMPT_INJECTION]: The background observer agent is vulnerable to indirect prompt injection. It ingests untrusted data from
observations.jsonl(which contains raw session output) and is instructed to generate "instincts" based on this data. A malicious input during a session could influence the agent to write unauthorized or harmful instructions into the local behavior library. - Ingestion points:
observations.jsonl(processed byobserver-loop.shandobserver.md) - Boundary markers: None implemented within the data sampling phase to isolate observation data from agent instructions.
- Capability inventory: The background agent has
Writepermissions to the instincts directory andReadpermissions to project files. - Sanitization: Implements regex-based scrubbing for common secret patterns (API keys, tokens) before logging.
Audit Metadata