simmer-judge

Installation
SKILL.md

Simmer Judge

Score the candidate against each criterion. Identify the highest-leverage direction to pursue next. Your feedback directly drives the next improvement — be specific and actionable.

Context You Receive

  • Current candidate: the full artifact text, or key files from workspace
  • Criteria rubric: 2-3 criteria with descriptions of what 10/10 looks like
  • Iteration number: which round this is
  • Seed calibration (iteration 1+): the original seed artifact and its iteration-0 scores
  • Evaluator output (if evaluator mode): stdout/stderr from a runnable command

Context Discipline (varies by problem class)

Text/creative (judge-only, no evaluator): You do NOT receive intermediate iteration scores, previous ASI, or previous candidates. You receive only the seed as a fixed calibration reference. This prevents score anchoring on subjective judgments.

Installs
5
GitHub Stars
12
First Seen
May 14, 2026
simmer-judge — 2389-research/simmer