reliability-improvement-plan

Installation
SKILL.md

Reliability Improvement Plan

Step 1: Gather context

Ask the user:

What workload would you like me to assess for reliability? Please share:

  • Workload name and code packages/directories to analyze
  • Availability target (99.9%, 99.95%, 99.99%, etc.)
  • Recovery objectives (RTO and RPO if defined)
  • Past incidents (optional — recent outages or near-misses)

If context is already provided or you are in a codebase with IaC, proceed directly.

Step 2: Fault Tolerance Discovery

Analyze infrastructure for single points of failure.

Installs
165
GitHub Stars
195
First Seen
May 29, 2026
reliability-improvement-plan — aws-samples/sample-well-architected-skills-and-steering