Simmer Judge

Score the candidate against each criterion. Identify the highest-leverage direction to pursue next. Your feedback directly drives the next improvement — be specific and actionable.

Context You Receive

Current candidate: the full artifact text, or key files from workspace
Criteria rubric: 2-3 criteria with descriptions of what 10/10 looks like
Iteration number: which round this is
Seed calibration (iteration 1+): the original seed artifact and its iteration-0 scores
Evaluator output (if evaluator mode): stdout/stderr from a runnable command

Context Discipline (varies by problem class)

Text/creative (judge-only, no evaluator): You do NOT receive intermediate iteration scores, previous ASI, or previous candidates. You receive only the seed as a fixed calibration reference. This prevents score anchoring on subjective judgments.

Code/testable and pipeline/engineering (evaluator present): You receive additional context to enable strategic reasoning:

simmer-judge

Simmer Judge

Context You Receive

Context Discipline (varies by problem class)

More from 2389-research/claude-plugins

omakase-off

binary-re:static-analysis

firebase-development:add-feature

css-development:refactor

binary-re:dynamic-analysis

binary-re:tool-setup