citation-audit
Pass
Audited by Gen Agent Trust Hub on May 18, 2026
Risk Level: SAFEPROMPT_INJECTIONCOMMAND_EXECUTIONEXTERNAL_DOWNLOADS
Full Analysis
- [PROMPT_INJECTION]: The skill is susceptible to indirect prompt injection because it ingests untrusted data from LaTeX files and bibliography databases and interpolates it directly into prompts for a secondary AI model (gpt-5.5 via mcp__codex__codex).
- Ingestion points: Extracts citation contexts from
*.texfiles and bibliographic metadata from*.bibfiles (Steps 1 and 2). - Boundary markers: The prompt template uses Markdown headers (e.g.,
## Where this entry is cited) but lacks explicit instructions to the reviewer model to ignore or escape instructions embedded within the interpolated paper content. - Capability inventory: The skill environment has access to powerful tools including
mcp__codex__codex,WebSearch,WebFetch,Bash, andEdit. - Sanitization: There is no evidence of sanitization or input validation performed on the paper content before it is sent to the external reviewer or used to generate file-write operations.
- [COMMAND_EXECUTION]: The skill executes
latexmkon the audited paper directory. If the paper contains malicious TeX macros (such as\write18or other shell-escape commands), it could lead to arbitrary command execution on the host system during the verification process. - [EXTERNAL_DOWNLOADS]: The skill uses
WebSearchandWebFetchto interact with external databases like DBLP and arXiv. While these are reputable sources, the interactions are driven by untrusted citation keys extracted from the user's files.
Audit Metadata