eval-outcomes

Installation
SKILL.md

eval-outcomes — moved to Mount Olympus (2026-06-10)

Holdout scenario management (absorbed from scenario, ag-s43tg)

Author and manage holdout scenarios with the ao CLI: ao scenario add "<title>" creates a scenario in .agents/holdout/ (ID s-YYYY-MM-DD-NNN, acceptance vectors, 0.8 default satisfaction threshold); ao scenario validate checks the holdout set's schema and link graph. Linked scenarios feed directive fitness via ao goals scenarios (see the /goals skill and docs/adr/ADR-0003).

Absorbed skills (ag-s43tg)

  • scenario — Manage holdout scenarios; author and manage holdout scenarios with measurable acceptance vectors and satisfaction scoring in .agents/holdout/ for behavioral validation.

This skill encodes independent-verdict machinery and now lives with the outer gate product. Canonical: ~/dev/mt-olympus/.claude/skills/eval-outcomes/SKILL.md — read and follow that file. This stub preserves fleet routing until the using-agentops catalog closer updates the registry (skill-prune Lane A, evidence/skill-prune-recon.md).

Installs
8
Repository
boshu2/agentops
GitHub Stars
399
First Seen
May 31, 2026
eval-outcomes — boshu2/agentops