behavior-first-planning
Pass
Audited by Gen Agent Trust Hub on Jun 20, 2026
Risk Level: SAFECOMMAND_EXECUTION
Full Analysis
- [COMMAND_EXECUTION]: The skill requires the agent to execute shell commands to run test suites (e.g., using
pytest,cargo, orgo test) to verify the state of acceptance tests in Phase 2 and Phase 4. - [DYNAMIC_EXECUTION]: The workflow involves generating test code in the
acceptance-tests/directory and subsequently executing it. This is a standard part of the TDD process defined by the skill. - [INDIRECT_PROMPT_INJECTION]: The skill ingests user 'intents' to generate planning beads. It includes specific instructions for an 'adversarial dimension checklist' to identify and mitigate common vulnerabilities (e.g., untrusted strings, forgeable trust markers) in the resulting behaviors.
Audit Metadata