werewolf-judge

Pass

Audited by Gen Agent Trust Hub on Apr 6, 2026

Risk Level: SAFECOMMAND_EXECUTION
Full Analysis
  • [COMMAND_EXECUTION]: The engine.py script provides a command-line interface for the agent to manage game cycles, such as initialization and phase transitions, which is a standard and safe use of CLI tools.
  • [DATA_EXPOSURE_AND_EXFILTRATION]: The skill allows the configuration of an audit log path via the --audit flag and creates local directories for storage. This operation is restricted to the local filesystem and used for legitimate game history logging without exposing sensitive system data or credentials.
  • [INDIRECT_PROMPT_INJECTION]: The skill processes player-provided data through JSON payloads, creating a surface for potential indirect injection if malicious instructions are embedded in player text.
  • Ingestion points: Player replies are ingested as JSON payloads via the --payload argument in engine.py.
  • Boundary markers: Structured JSON parsing is used, and the skill documentation emphasizes protocol validation.
  • Capability inventory: Local filesystem directory creation and file writing for audit logs.
  • Sanitization: Input is parsed using json.loads and validated by the referenced werewolf_core package.
Audit Metadata
Risk Level
SAFE
Analyzed
Apr 6, 2026, 02:27 PM