skill-autoresearch

Installation
SKILL.md

Skill Autoresearch

Use this skill to improve another skill through measured iteration instead of gut feel.

The job is simple: run the target skill on a small test set, score outputs with binary evals, change one thing in the prompt, and keep only mutations that improve the score. Repeat until the score plateaus, the budget cap is hit, or the user stops the loop.

When to use this skill

  • A skill works inconsistently and needs a repeatable improvement loop
  • You want to benchmark a SKILL.md before editing it
  • You need binary evals for prompt or skill quality
  • You want a mutation log instead of ad-hoc rewriting
  • You want to compare baseline vs improved prompt behavior

Required inputs

Do not start experiments until all inputs below are known:

Related skills
Installs
6
GitHub Stars
1
First Seen
Mar 21, 2026