explaining-machine-learning-models

Pass

Audited by Gen Agent Trust Hub on Apr 30, 2026

Risk Level: SAFE
Full Analysis
  • [SAFE]: No malicious patterns or security vulnerabilities were identified in the analyzed files.
  • [PROMPT_INJECTION]: The skill processes user-provided datasets and model predictions for interpretability analysis. This presents a potential surface for indirect prompt injection. 1. Ingestion points: External datasets and model files (e.g., SKILL.md instructions). 2. Boundary markers: None explicitly defined for data interpolation in the provided templates. 3. Capability inventory: Bash, Write, and Edit tools are available. 4. Sanitization: No input sanitization logic is specified. However, this functionality is central to the skill's purpose and no malicious patterns were observed.
  • [DATA_EXFILTRATION]: No hardcoded credentials or unauthorized network operations were found in the static analysis.
Audit Metadata
Risk Level
SAFE
Analyzed
Apr 30, 2026, 08:34 PM
Security Audit — agent-trust-hub — explaining-machine-learning-models