step-back
Pass
Audited by Gen Agent Trust Hub on May 13, 2026
Risk Level: SAFE
Full Analysis
- [SAFE]: No malicious patterns, obfuscation, or unauthorized access attempts were detected in the skill's instructions or evaluation prompts.
- [PROMPT_INJECTION]: The skill requires the agent to ingest and reflect upon previous conversation history. While this is an indirect prompt injection surface where external input could attempt to influence the agent's logic, the instructions include explicit safeguards such as 'No false alarms' and 'No false validation' to ensure the agent remains objective and evidence-based.
- [COMMAND_EXECUTION]: The skill instructs the agent to create and save a local Markdown file (e.g.,
step-back-YYYY-MM-DD-HHMM.md). This is a routine file system operation used to provide the user with a persistent record of the reflection and does not target sensitive system paths or configuration files.
Audit Metadata