The Agent Skills Directory

[PROMPT_INJECTION]: The skill processes external evaluation data, creating an indirect prompt injection surface.
Ingestion points: The scripts/evaluation.py script parses user-provided XML files to extract questions for the agent loop.
Boundary markers: The EVALUATION_PROMPT utilizes XML tags to structure the agent's output, but it does not provide robust isolation or delimiters for the untrusted question text from the XML.
Capability inventory: The skill can execute local shell commands (via scripts/connections.py) and perform network requests to the Anthropic API during evaluation tasks.
Sanitization: There is no evidence of input validation, escaping, or sanitization performed on the content of the <question> tags before it is interpolated into the agent prompt.
[COMMAND_EXECUTION]: The evaluation utility in scripts/evaluation.py and the connection logic in scripts/connections.py utilize subprocesses to execute local commands for MCP server communication when the stdio transport is used. This behavior is intended for testing local server implementations.

mcp-builder