Apastra Baseline

Establish baselines from evaluation runs. A baseline is a snapshot of a scorecard that represents "known good" — future evaluations compare against it to detect regressions.

When to Use

Use this skill when you want to:

Establish the first baseline after running an initial evaluation
Update the baseline after a prompt improvement has been verified
Roll back to a prior baseline

Establishing a Baseline

When asked to establish a baseline (e.g., "set the current results as the baseline for summarize-smoke"):

Step 1: Locate the Scorecard

Find the most recent run for the target suite in promptops/runs/. Look for the latest directory matching <suite-id>-* and read its scorecard.json.

apastra-baseline

Apastra Baseline

When to Use

Establishing a Baseline

Step 1: Locate the Scorecard

More from bintzgavin/apastra

apastra

apastra-validate

apastra-scaffold

apastra-eval

apastra-getting-started

apastra-setup-ci