code-testing

Installation
SKILL.md

Code testing

Three modes. Pick by task:

  • Building or fixing → Test-first loop
  • Reviewing or auditing existing tests → Desiderata review
  • Deciding strategy or test level → Choosing the test level

All three share the same philosophy.

Philosophy

Core principle: Tests should verify behavior through public interfaces, not implementation details. Code can change entirely; tests shouldn't.

Good tests are integration-style: they exercise real code paths through public APIs. They describe what the system does, not how it does it. A good test reads like a specification — "user can checkout with valid cart" tells you exactly what capability exists. These tests survive refactors because they don't care about internal structure.

Bad tests are coupled to implementation. They mock internal collaborators, test private methods, or verify through external means (like querying a database directly instead of using the interface). The warning sign: your test breaks when you refactor, but behavior hasn't changed. If you rename an internal function and tests fail, those tests were testing implementation, not behavior.

See tests.md for examples and mocking.md for mocking guidelines.

Installs
4
GitHub Stars
4
First Seen
Jun 3, 2026
code-testing — nweii/agent-stuff