evaluating-with-promptfoo

Fail

Audited by Snyk on Mar 29, 2026

Risk Level: CRITICAL
Full Analysis

CRITICAL E006: Malicious code pattern detected in skill scripts.

  • Malicious code pattern detected (high risk: 0.90). The content documents an offensive, dual-use red‑teaming toolkit with multiple explicit features that enable intentional abuse: data‑exfiltration plugins and poisoned‑RAG tooling, jailbreak/prompt‑injection strategies, HTTP/custom providers that can send environment secrets to arbitrary endpoints, arbitrary JS/Python hooks/assertions and file:// providers that allow remote code execution, and an MCP server/CI integrations that can be abused as a remote execution/backdoor vector.

MEDIUM W011: Third-party content exposure detected (indirect prompt injection risk).

  • Third-party content exposure detected (high risk: 0.90). The skill's documentation (INSTRUCTIONS.md and references/PROVIDERS.md) explicitly supports ingesting external, arbitrary HTTP endpoints (providers with config.url), Google Sheets and other external data sources, and RAG/context extraction plus red-team strategies like "indirect-web-pwn" and "indirect-prompt-injection" — all of which indicate the agent will read and act on untrusted third-party content that can alter subsequent transforms, assertions, and test flows.

Issues (2)

E006
CRITICAL

Malicious code pattern detected in skill scripts.

W011
MEDIUM

Third-party content exposure detected (indirect prompt injection risk).

Audit Metadata
Risk Level
CRITICAL
Analyzed
Mar 29, 2026, 11:13 PM
Issues
2
Security Audit — snyk — evaluating-with-promptfoo