run-codex-1
Fail
Audited by Gen Agent Trust Hub on May 19, 2026
Risk Level: HIGHCOMMAND_EXECUTIONDATA_EXFILTRATIONREMOTE_CODE_EXECUTIONPROMPT_INJECTION
Full Analysis
- [COMMAND_EXECUTION]: The skill's core configuration in
scripts/codex-flags.shenforces the--dangerously-bypass-approvals-and-sandboxflag for all managed operations. This removes the security sandbox of the orchestrated tool, granting the autonomous agent full filesystem access and unrestricted network egress as documented inreferences/universal/codex-flags.md. - [DATA_EXFILTRATION]: In
scripts/setup-worktree.sh, the skill automatically symlinks.env.local(and other environment files specified by theLINK_ENV_FILEvariable) from the primary repository into isolated worktrees. This exposes sensitive development credentials and secrets to the autonomous agents and any commands they execute. - [REMOTE_CODE_EXECUTION]: The
run-fleet.shscript (and its associated documentation inrun-fleet.md) implements an automatic 'post-verify' mechanism that detects and executes shell commands such aspnpm test,cargo check, orgo vetbased on files present in the repository. A malicious repository could exploit this to trigger arbitrary code execution during the automated fleet run. - [PROMPT_INJECTION]: The skill uses a 'SUBAGENT-STOP' instruction prefix in its templates (found in
references/templates/exec.tmpl.mdandreferences/templates/review.tmpl.md) designed to override the backend model's internal meta-skills and planning protocols. While intended for performance, this constitutes a deliberate bypass of the managed model's standard operational guidelines. - [DATA_EXFILTRATION]: By enabling unrestricted network egress via the sandbox bypass, the skill creates a path for potentially sensitive repository data or environment secrets to be sent to external endpoints during task execution.
- [COMMAND_EXECUTION]: The dispatcher in
scripts/run-codex-1.mjsuseschild_process.spawnwithdetached: trueto run its background workers. While functional for orchestration, this pattern reduces oversight of the commands being executed in the background.
Recommendations
- AI detected serious security threats
Audit Metadata