platform-engineering
Installation
SKILL.md
Platform Engineering
Consolidates Observability and Performance Engineering to build high-performance, resilient, and observable systems.
When to Use This Skill
- Observability: Setting up Prometheus/Grafana, OpenTelemetry, logging pipelines.
- Performance: Debugging latency, optimizing DB queries, caching strategies.
- Reliability: Defining SLIs/SLOs, Load Testing, Chaos Engineering.
- Infrastructure: Incident response and post-mortems.
Core Disciplines
1. Observability (The "Three Pillars")
Logs (Events)
- Purpose: Debugging specific events. "What happened?"
- Best Practice: Structured JSON logging. Application-context only (no noise).