skill-autoresearch
Skill Autoresearch
Use this skill to improve another skill through measured iteration instead of gut feel.
The job is simple: run the target skill on a small test set, score outputs with binary evals, change one thing in the prompt, and keep only mutations that improve the score. Repeat until the score plateaus, the budget cap is hit, or the user stops the loop.
When to use this skill
- A skill works inconsistently and needs a repeatable improvement loop
- You want to benchmark a SKILL.md before editing it
- You need binary evals for prompt or skill quality
- You want a mutation log instead of ad-hoc rewriting
- You want to compare baseline vs improved prompt behavior
Required inputs
Do not start experiments until all inputs below are known:
More from akillness/oh-my-gods
deepagents
>
19agent-workflow
>
19data-analysis
>
16omg
OMG — Integrated AI agent orchestration skill. Plan with ralph+plannotator, execute with team/bmad, verify browser behavior with agent-browser, apply UI feedback with agentation(annotate), auto-cleanup worktrees after completion. Supports Claude, Codex, Gemini CLI, and OpenCode. Install: ralph, omc, omx, ohmg, bmad, plannotator, agent-browser, agentation.
16frontend-design-system
Produce production-grade UI designs using clear design tokens, layout rules, motion guidance, and accessibility checks for consistent, scalable frontend development.
15omc
oh-my-claudecode — Teams-first multi-agent orchestration layer for Claude Code. 32 specialized agents, smart model routing, persistent execution loops, and real-time HUD visibility. Zero learning curve.
15