result-diagnosis

Installation
SKILL.md

Result Diagnosis

Diagnose what an experiment result means for the project. This skill is for decision-making after results exist, especially when they are negative, surprising, unstable, or hard to interpret.

Use this skill when:

  • a method does not improve over baseline
  • results vary strongly across seeds
  • a metric improves but another metric worsens
  • a baseline unexpectedly wins
  • a plot or table looks suspicious
  • a result may be caused by an implementation bug, metric bug, data issue, or unfair comparison
  • early experiments suggest revising the algorithm or paper claim
  • the user asks "what does this result mean?" or "what should we do next?"

Do not use this skill to write a polished report. Pair it with experiment-report-writer after the diagnosis is clear.

Pair this skill with:

Related skills
Installs
27
GitHub Stars
4
First Seen
11 days ago