eval-relevance
Installation
SKILL.md
Eval Relevance
Use this skill to evaluate how relevant an assistant response is to the user’s request.
Inputs
Require:
- The assistant response text to evaluate.
- (Optional) The user’s original request for comparison.
Internal Rubric (1–5)
5 = Directly addresses the user’s request, stays fully on-topic, and prioritizes what the user actually asked
4 = Mostly relevant, minor digressions or small omissions
3 = Partially relevant, addresses the general topic but misses key parts of the request
2 = Weak relevance, significant digressions or failure to address the core request
1 = Not relevant, does not address the user’s request or answers a different question entirely