enhance-skill-by-derailment
Warn
Audited by Gen Agent Trust Hub on May 10, 2026
Risk Level: MEDIUMEXTERNAL_DOWNLOADSREMOTE_CODE_EXECUTIONCOMMAND_EXECUTIONDATA_EXFILTRATION
Full Analysis
- [EXTERNAL_DOWNLOADS]: Fetches skill definitions and reference materials from user-specified GitHub repositories using the GitHub CLI (
gh api). - [REMOTE_CODE_EXECUTION]: Directly executes remote instructions by downloading them from external sources and passing them to a subagent for execution. This enables unvetted external content to control agent behavior within the test environment.
- [COMMAND_EXECUTION]: Employs several shell commands (including
gh,find,grep, andwc) and a Python one-liner to download files, extract execution traces from JSONL files, and verify the integrity of the skill structure. - [DATA_EXFILTRATION]: By executing untrusted remote instructions, the skill creates an attack surface where a malicious skill could instruct the subagent to exfiltrate sensitive data from the execution environment.
Audit Metadata