github-pr-reviewer
Pass
Audited by Gen Agent Trust Hub on Jun 24, 2026
Risk Level: SAFEPROMPT_INJECTIONCOMMAND_EXECUTION
Full Analysis
- [PROMPT_INJECTION]: The skill is vulnerable to indirect prompt injection because it processes untrusted pull request content (titles, descriptions, and comments) within an agent environment that has access to sensitive tools and credentials.
- Ingestion points: In
scripts/main.py, the_build_review_promptfunction interpolates the PR title and description directly into the prompt. Furthermore, the agent is instructed to usegh pr viewto inspect the PR's discussion and comments, which are attacker-controllable. - Boundary markers: The PR description is enclosed in triple-dash (
---) delimiters, which offer a weak structural separation but are not sufficient to prevent a model from following malicious instructions embedded in the description. - Capability inventory: The reviewer agent is equipped with the
terminalandfile_editortools and is granted access to theGITHUB_PERSONAL_ACCESS_TOKENvia thesecretspayload in thecreate_conversationcall. A successful indirect injection could lead to arbitrary command execution or token exfiltration. - Sanitization: The script does not perform sanitization, filtering, or validation on the PR content or comments before they are presented to the LLM.
- [COMMAND_EXECUTION]: The skill's setup process involves the agent generating and writing a Python script (
main.py) to the filesystem based on user input. While the instructions recommend usingjson.dumps()for safety, this pattern relies on the setup agent's adherence to instructions to prevent code injection during the customization phase.
Audit Metadata