os-skill-improvement

Installation
SKILL.md

Skill Continuous Improvement: RED-GREEN-REFACTOR

Adapts the RED-GREEN-REFACTOR cycle from software testing to skill authoring. The key insight: a skill is a testable contract. The failure to follow the contract is observable. Always observe the failure BEFORE writing the fix.

Integrated with:

  • os-eval-runner -- runs eval_runner.py as the GREEN verification step
  • os-learning-loop -- uses this methodology to gate every proposed skill patch
  • evals/evals.json + results.tsv -- autoresearch eval format for longitudinal tracking

The TDD Mapping

Software TDD Skill Authoring Equivalent
Test case Pressure scenario: a user prompt that should trigger the skill
RED phase Run a baseline WITHOUT the skill. Observe: does the agent violate the intended protocol?
Related skills
Installs
1
GitHub Stars
2
First Seen
Apr 3, 2026