google-sre
Installation
SKILL.md
Google Site Reliability Engineering (SRE)
Overview
Site Reliability Engineering (SRE) is Google's approach to running production systems. It applies software engineering principles to operations, treating reliability as a feature that can be measured, budgeted, and engineered.
References
- Book: "Site Reliability Engineering: How Google Runs Production Systems" (O'Reilly, 2016)
- Workbook: "The Site Reliability Workbook" (O'Reilly, 2018)
- Online: https://sre.google/
Core Philosophy
"Hope is not a strategy."
"SRE is what happens when you ask a software engineer to design an operations function."
"Reliability is the most important feature."