security-hardening

Pass

Audited by Gen Agent Trust Hub on Apr 18, 2026

Risk Level: SAFE
Full Analysis
  • [PROMPT_INJECTION]: The skill includes detection patterns for prompt injection techniques such as 'ignore instructions' and 'DAN' mode. These are used strictly for security filtering and documentation purposes.
  • [COMMAND_EXECUTION]: The skill provides defensive command interception logic to prevent non-admin users from executing high-risk operations. The setup script generates these components locally.
  • [DATA_EXFILTRATION]: The skill redacts sensitive information from the agent's output and logs security events locally. No network-based exfiltration was identified.
  • [SAFE]: The skill is a legitimate security tool designed to improve the safety posture of the agent environment.
Audit Metadata
Risk Level
SAFE
Analyzed
Apr 18, 2026, 01:11 AM
Security Audit — agent-trust-hub — security-hardening