apastra-baseline

Installation
SKILL.md

Apastra Baseline

Establish baselines from evaluation runs. A baseline is a snapshot of a scorecard that represents "known good" — future evaluations compare against it to detect regressions.

When to Use

Use this skill when you want to:

  • Establish the first baseline after running an initial evaluation
  • Update the baseline after a prompt improvement has been verified
  • Roll back to a prior baseline

Establishing a Baseline

When asked to establish a baseline (e.g., "set the current results as the baseline for summarize-smoke"):

Step 1: Locate the Scorecard

Find the most recent run for the target suite in promptops/runs/. Look for the latest directory matching <suite-id>-* and read its scorecard.json.

Related skills
Installs
5
First Seen
Mar 12, 2026