incident-response

Installation
SKILL.md

Incident Response

When production breaks, speed and clarity matter more than perfection. Mitigate first, root-cause later.

Context

Incident response is the skill of managing production failures under pressure. It combines technical diagnosis with communication discipline. The goal is not to find the perfect fix immediately -- it's to stop the bleeding, then investigate properly.

In a lifecycle-aware system, incident response must preserve the release boundary that just shipped. Do not widen scope into redesign during the incident. For brownfield systems, prefer mitigations that fail closed, preserve coexistence, and protect data integrity even if the temporary user experience becomes narrower.

Inputs

  • ci-cd-pipeline -- produced by the preceding skill in the lifecycle
  • architecture-doc -- produced by the preceding skill in the lifecycle
  • service-alerts -- produced by the preceding skill in the lifecycle

Process

Step 1: Confirm the Incident and Current Boundary

Related skills

More from yknothing/prodcraft

Installs
6
First Seen
Mar 27, 2026