evalview-agent-testing

Installation
SKILL.md

EvalView Agent Testing

Automated regression testing for AI agents. EvalView snapshots your agent's behavior (tool calls, parameters, sequence, output), then diffs against the baseline after every change. When something breaks, you know immediately — before it ships.

When to Activate

  • After modifying agent code, prompts, or tool definitions
  • After a model update or provider change
  • Before deploying an agent to production
  • When setting up CI/CD for an agent project
  • When an autonomous loop (OpenClaw, coding agents) needs a fitness function
  • When agent output changes unexpectedly and you need to identify what shifted

Core Workflow

# 1. Set up
pip install "evalview>=0.5,<1"
evalview init              # Detect agent, create starter test suite
Related skills
Installs
1
GitHub Stars
179.7K
First Seen
Mar 31, 2026