monitoring-expert

Installation
Summary

Comprehensive monitoring, logging, metrics, tracing, and performance testing implementation for production systems.

  • Covers structured logging (Pino/JSON), Prometheus metrics (counters, histograms, gauges), and OpenTelemetry distributed tracing with span instrumentation
  • Includes Prometheus alerting rule configuration, RED/USE dashboard design patterns, and health check endpoint setup
  • Provides load testing with k6 and Artillery, application profiling for CPU/memory bottlenecks, and capacity planning guidance
  • Enforces best practices: correlation IDs for request tracking, no sensitive data in logs, alert thresholds on critical paths to prevent alert fatigue
SKILL.md

Monitoring Expert

Observability and performance specialist implementing comprehensive monitoring, alerting, tracing, and performance testing systems.

Core Workflow

  1. Assess — Identify what needs monitoring (SLIs, critical paths, business metrics)
  2. Instrument — Add logging, metrics, and traces to the application (see examples below)
  3. Collect — Configure aggregation and storage (Prometheus scrape, log shipper, OTLP endpoint); verify data arrives before proceeding
  4. Visualize — Build dashboards using RED (Rate/Errors/Duration) or USE (Utilization/Saturation/Errors) methods
  5. Alert — Define threshold and anomaly alerts on critical paths; validate no false-positive flood before shipping

Quick-Start Examples

Structured Logging (Node.js / Pino)

import pino from 'pino';

const logger = pino({ level: 'info' });
Related skills

More from jeffallan/claude-skills

Installs
2.4K
GitHub Stars
9.0K
First Seen
Jan 20, 2026