autonomous-skill
Fail
Audited by Gen Agent Trust Hub on Mar 29, 2026
Risk Level: HIGHREMOTE_CODE_EXECUTIONCOMMAND_EXECUTIONDATA_EXFILTRATIONPROMPT_INJECTION
Full Analysis
- [COMMAND_EXECUTION]: The
run-session.shscript executes thecodexcommand, which allows the AI to run arbitrary shell commands on the host system to complete tasks. While this is the core feature of the skill, it presents a significant security risk if the AI-generated commands are malicious. - [REMOTE_CODE_EXECUTION]: The skill uses the Codex model to generate and execute code autonomously. The loop mechanism in
run-session.shallows for continuous, multi-session execution of remote-generated logic without manual review between steps. - [DATA_EXFILTRATION]: The
--networkflag inrun-session.shenables the--dangerously-bypass-approvals-and-sandboxoption for the Codex CLI. This explicitly disables platform security controls and allows the autonomous agent to communicate with external servers, facilitating the exfiltration of sensitive project data or environment variables. - [PROMPT_INJECTION]: The skill processes untrusted user input (the task description) and interpolates it directly into the system prompts for the 'Initializer' and 'Executor' agents.
- Ingestion points: The
task_descargument inrun-session.shis passed directly into the AI prompt. - Boundary markers: The prompts do not use robust delimiters or instructions to prevent the user description from overriding the agent's core safety guidelines or workflow logic.
- Capability inventory: The agents have access to
codex exec, file system writes, and optional network access. - Sanitization: No sanitization is performed on the task description content to prevent adversarial prompt injection.
- [CREDENTIALS_UNSAFE]: The execution templates (
executor-prompt.md) instruct the agent to runls -laandcatfiles to 'get its bearings'. This pattern makes it easy for a compromised agent or a malicious task description to discover and read sensitive files like.envor.sshconfigurations.
Recommendations
- AI detected serious security threats
Audit Metadata