verification-loops
Pass
Audited by Gen Agent Trust Hub on Apr 12, 2026
Risk Level: SAFEPROMPT_INJECTION
Full Analysis
- [PROMPT_INJECTION]: The
LLMJudgeGraderimplementation inSKILL.mdpresents a surface for indirect prompt injection by interpolating untrusted agent outputs directly into an evaluation prompt without delimiters.\n - Ingestion points: The
outputparameter in theevaluatemethod ofLLMJudgeGrader.\n - Boundary markers: Absent; the agent output is concatenated directly into the
JUDGE_PROMPTstring template.\n - Capability inventory: The result of this grader determines the control flow of the agent pipeline (acceptance or rejection of generated content).\n
- Sanitization: None; the skill provides a conceptual implementation without input escaping or validation.
Audit Metadata