canned-responses-anthropic
Pass
Audited by Gen Agent Trust Hub on May 16, 2026
Risk Level: SAFE
Full Analysis
- [SAFE]: No security issues were detected. The skill consists of text-based instructions and templates for legal response management.
- [PROMPT_INJECTION]: No evidence of system prompt extraction, safety bypass instructions, or malicious role-play triggers was found. The instructions reinforce safety by requiring human review for legal communications.
- [DATA_EXFILTRATION]: No network operations, credential harvesting, or unauthorized sensitive file access patterns were detected. The skill instructions suggest maintaining templates in local settings without prescribing unsafe storage methods.
- [INDIRECT_PROMPT_INJECTION]: The skill processes untrusted external input (legal inquiries) to generate responses. While this represents a standard attack surface for indirect injection, the risk is mitigated by the skill's extensive list of 'Escalation Triggers' that require the agent to stop and alert the user for review when sensitive content or high-risk situations (like litigation or regulatory investigations) are detected.
- [REMOTE_CODE_EXECUTION]: No remote code execution patterns or external script downloads were identified. The skill does not perform any package installations.
Audit Metadata