Claude Flow CLI
Fail
Audited by Gen Agent Trust Hub on May 10, 2026
Risk Level: HIGHREMOTE_CODE_EXECUTIONEXTERNAL_DOWNLOADSCOMMAND_EXECUTIONPROMPT_INJECTIONDATA_EXFILTRATION
Full Analysis
- [REMOTE_CODE_EXECUTION]: The instructions mandate the use of
npx @claude-flow/cli@latestfor all operations. This pattern fetches and executes code directly from the NPM registry at runtime. Using the@latesttag ensures the tool always runs the most recent version, making the environment highly vulnerable to supply chain attacks if the package or the developer's account is compromised.\n- [EXTERNAL_DOWNLOADS]: The skill requires downloading the@claude-flow/clipackage from the public NPM registry, which is an external and untrusted source relative to the core agent platform.\n- [COMMAND_EXECUTION]: The skill is primarily a collection of shell commands designed to manage agents, swarms, and memory. These commands allow for significant control over the host environment, including spawning new processes and modifying system state.\n- [PROMPT_INJECTION]: The skill uses the name 'Claude Flow', which leverages the brand of a well-known AI model (Claude). Given that the author is 'ricable', this use of branding is potentially deceptive and constitutes metadata poisoning, as it may mislead users or agents into assuming the tool is an official Anthropic product.\n- [DATA_EXFILTRATION]: The presence of commands likememory export --file <path>andsession export --file <path>allows for the extraction of internal agent state and memory to external files. If directed by a malicious prompt, these tools could be used to harvest and exfiltrate sensitive data from the agent's context.\n- [PROMPT_INJECTION]: The skill presents an indirect prompt injection surface as it ingests untrusted data during task creation and memory storage.\n - Ingestion points:
task create --name <name>,memory store --value <value>, and swarm configuration inputs in SKILL.md.\n - Boundary markers: None are defined in the instructions to separate untrusted data from system commands.\n
- Capability inventory: The skill has extensive capabilities including shell command execution (
npx), file writing (export), and agent spawning across all described commands.\n - Sanitization: No sanitization or validation of the input data is described or implemented in the instructions.
Recommendations
- AI detected serious security threats
Audit Metadata