qa-observability
Installation
SKILL.md
QA Observability and Performance Engineering
Use telemetry (logs, metrics, traces, profiles) as a QA signal and a debugging substrate.
Core references (see data/sources.json): OpenTelemetry, W3C Trace Context, and SLO practices (Google SRE).
Quick Start (Default)
If key context is missing, ask for: critical user journeys, service/dependency inventory, environments (local/staging/prod), current telemetry stack, and current SLO/SLA commitments (if any).
- Establish the minimum bar: correlation IDs + structured logs + traces + golden metrics (latency, traffic, errors, saturation).
- Verify propagation: confirm
traceparent(and your request ID) flow across boundaries end-to-end. - Make failures diagnosable: every test failure captures a trace link (or trace ID) plus the correlated logs.
- Define SLIs/SLOs and error budget policy; wire burn-rate alerts (prefer multi-window burn rates).
- Produce artifacts: a readiness checklist plus an SLO definition and alert rules (use
assets/checklists/template-observability-readiness-checklist.mdandassets/monitoring/slo/*).