solution-iterator
Warn
Audited by Gen Agent Trust Hub on Mar 29, 2026
Risk Level: MEDIUMCOMMAND_EXECUTIONREMOTE_CODE_EXECUTIONDATA_EXFILTRATIONPROMPT_INJECTIONEXTERNAL_DOWNLOADS
Full Analysis
- [COMMAND_EXECUTION]: The skill frequently executes shell commands to interact with external CLI tools and run test suites. This includes calling
claude-codeandcodexwith dynamically constructed prompts, as well as running local test runners such ascargo test,npm test, andpython -m pyteston generated code. - [REMOTE_CODE_EXECUTION]: The skill implements a loop that generates code using external AI services and then executes that code locally to evaluate its correctness. Generated code is saved to the
solutions/directory and executed using system test tools, representing a dynamic execution flow where content generated by a remote service is run in the local environment. - [DATA_EXFILTRATION]: The skill transmits user-provided problem descriptions and test context to external AI services (
claude-codeorcodex) via their respective CLI interfaces for processing. While necessary for the skill's operation, this involves sending local project context to external domains. - [PROMPT_INJECTION]: The skill is susceptible to indirect prompt injection because it processes untrusted input (problem descriptions and test context) and uses it to drive code generation and local execution.
- Ingestion points: Problem descriptions and test context provided by the user in the initial setup phase.
- Boundary markers: None identified in the prompt construction logic to separate user data from instructions.
- Capability inventory: Full shell access for running CLIs and test runners, and filesystem access for reading/writing code and logs.
- Sanitization: No validation or sanitization of the generated code is performed before execution.
- [EXTERNAL_DOWNLOADS]: The
dashboard.htmlfile includes a reference to the Chart.js library hosted on a public CDN (jsdelivr or similar) to generate visual progress charts. This reference targets a well-known library and service.
Audit Metadata