evaluation-harness

Installation
SKILL.md

Evaluation Harness

Build systematic evaluation frameworks for LLM applications.

Golden Dataset Format

Installs
8
GitHub Stars
2
First Seen
Feb 16, 2026
evaluation-harness — monkey1sai/openai-cli