skill-eval

Installation
SKILL.md

Skill Eval

对 Skill 的执行流程进行单次端到端质量评测:

生成测试用例 → 运行 Skill → Grading 评分 → Benchmark 聚合 → 呈现结果

Phase 1: 生成测试用例

与用户确认后,生成 evals/evals.json

{
  "skill_name": "example-skill",
  "evals": [
    {
      "id": 1,
      "prompt": "用户的任务 prompt",
Related skills

More from hixuanxuan/long-running-agent-tasks

Installs
10
GitHub Stars
18
First Seen
Apr 2, 2026