Agent Evaluation Skill

Objective, evidence-based quality assessment for agents and skills. Implements a 6-phase rubric: Identify, Structural, Content, Code, Integration, Report. Every finding must cite a file path and line number — no subjective "looks good" verdicts.

Reference Loading Table

Signal	Load These Files	Why
tasks related to this reference	`batch-evaluation.md`	Loads detailed guidance from `batch-evaluation.md`.
tasks related to this reference	`common-issues.md`	Loads detailed guidance from `common-issues.md`.
tasks related to this reference	`report-templates.md`	Loads detailed guidance from `report-templates.md`.
tasks related to this reference	`scoring-rubric.md`	Loads detailed guidance from `scoring-rubric.md`.

Instructions

Phase 1: Identify Evaluation Targets

Goal: Determine what to evaluate and confirm targets exist.

Related skills

More from notque/claude-code-toolkit

Installs

Repository

notque/claude-c…-toolkit

GitHub Stars

366

First Seen

Mar 23, 2026

Security Audits

Gen Agent Trust HubPass

SocketPass

SnykPass

agent-evaluation

Agent Evaluation Skill

Reference Loading Table

Instructions

Phase 1: Identify Evaluation Targets

More from notque/claude-code-toolkit

generate-claudemd

fish-shell-config

pptx-generator

codebase-overview

image-to-video

data-analysis