evidence-driven-agent-rules

Pass

Audited by Gen Agent Trust Hub on May 18, 2026

Risk Level: SAFEPROMPT_INJECTIONCOMMAND_EXECUTION
Full Analysis
  • [PROMPT_INJECTION]: The skill has a surface for indirect prompt injection as it reads and analyzes failure logs authored by users.
  • Ingestion points: The promote workflow reads multiple files matching docs/reflection-log/[0-9]*.md to identify recurring patterns.
  • Boundary markers: No specific boundary markers or 'ignore' instructions are used when interpolating log content into the agent's context during rule generation.
  • Capability inventory: The skill can execute grep via the shell and has the authority to write to AGENTS.md, git hooks, and CI configurations.
  • Sanitization: Content from the logs is not sanitized; however, the skill explicitly instructs the agent to ask for confirmation before writing any changes.
  • [COMMAND_EXECUTION]: The skill utilizes shell commands, primarily grep, for pattern matching across log files. It also includes a developer utility script, evals/run-static-checks.sh, which executes standard development tools like git, python3, and readlink to validate the skill structure and schema compliance.
Audit Metadata
Risk Level
SAFE
Analyzed
May 18, 2026, 09:47 PM
Security Audit — agent-trust-hub — evidence-driven-agent-rules