creating-eval-scenarios
Pass
Audited by Gen Agent Trust Hub on Jun 20, 2026
Risk Level: SAFECOMMAND_EXECUTIONPROMPT_INJECTION
Full Analysis
- [COMMAND_EXECUTION]: The skill uses the 'tessl' command-line interface to execute evaluation runs and check their status (e.g., 'tessl eval run', 'tessl eval view-status', 'tessl eval list').
- [PROMPT_INJECTION]: The skill is susceptible to indirect prompt injection as it ingests and processes content from external SKILL.md and reference files to generate test scenarios.
- Ingestion points: 'SKILL.md', 'references/', and 'scripts/' directories within the target tile directory.
- Boundary markers: No explicit delimiters or instructions to ignore embedded commands are present in the analysis logic.
- Capability inventory: The skill has file system access (read/write) and shell execution capabilities via the 'tessl' CLI.
- Sanitization: No sanitization or escaping of the content extracted from analyzed files is mentioned in the workflow.
Audit Metadata