creating-eval-scenarios

Pass

Audited by Gen Agent Trust Hub on Jun 20, 2026

Risk Level: SAFECOMMAND_EXECUTIONPROMPT_INJECTION
Full Analysis
  • [COMMAND_EXECUTION]: The skill uses the 'tessl' command-line interface to execute evaluation runs and check their status (e.g., 'tessl eval run', 'tessl eval view-status', 'tessl eval list').
  • [PROMPT_INJECTION]: The skill is susceptible to indirect prompt injection as it ingests and processes content from external SKILL.md and reference files to generate test scenarios.
  • Ingestion points: 'SKILL.md', 'references/', and 'scripts/' directories within the target tile directory.
  • Boundary markers: No explicit delimiters or instructions to ignore embedded commands are present in the analysis logic.
  • Capability inventory: The skill has file system access (read/write) and shell execution capabilities via the 'tessl' CLI.
  • Sanitization: No sanitization or escaping of the content extracted from analyzed files is mentioned in the workflow.
Audit Metadata
Risk Level
SAFE
Analyzed
Jun 20, 2026, 03:13 AM
Security Audit — agent-trust-hub — creating-eval-scenarios