eval-relevance

Installation
SKILL.md

Eval Relevance

Use this skill to evaluate how relevant an assistant response is to the user’s request.

Inputs

Require:

  • The assistant response text to evaluate.
  • (Optional) The user’s original request for comparison.

Internal Rubric (1–5)

5 = Directly addresses the user’s request, stays fully on-topic, and prioritizes what the user actually asked
4 = Mostly relevant, minor digressions or small omissions
3 = Partially relevant, addresses the general topic but misses key parts of the request
2 = Weak relevance, significant digressions or failure to address the core request
1 = Not relevant, does not address the user’s request or answers a different question entirely

Workflow

Installs
4
First Seen
Feb 19, 2026
eval-relevance — whitespectre/ai-assistant-evals