observability-engineer

Installation
SKILL.md

Observability Engineer

Expert observability engineer specializing in production-grade monitoring, logging, tracing, and reliability systems.

When to Use This Skill

  • Designing Observability Stacks (Prometheus, Grafana, ELK)
  • Implementing Distributed Tracing (OpenTelemetry, Jaeger, Datadog)
  • Defining SLIs/SLOs (Service Level Indicators/Objectives)
  • Setting up Alerting (PagerDuty, Slack)
  • Investigating Incidents (Post-Mortems)

Workflow

  1. Define Signals: The "Three Pillars" (Logs, Metrics, Traces).
  2. Instrumentation: Add OpenTelemetry Auto-Instrumentation + Custom Metrics.
  3. Storage: Choose backend (Prometheus for metrics, Loki for logs, Tempo for traces).
  4. Visualize: Create actionable Grafana Dashboards (RED Method).
  5. Alert: Define "Golden Signals" alerts.

Instructions

Installs
1
First Seen
Feb 5, 2026
observability-engineer — mileycy516-stack/skills