observability-design
Observability Design
Design comprehensive observability across metrics, logs, and traces to understand system behavior and debug issues.
Context
You are building observability for a system. The user struggles to debug production issues or wants better visibility. Read their current monitoring setup.
Domain Context
Based on Google's SRE practices and observability research:
- Metrics: Quantitative measurements (latency, error rate, QPS). Fast queries, low cardinality.
- Logs: Event records with context (error messages, user actions). High volume, searchable.
- Traces: Request flow across services. Shows dependencies and latency breakdown.
- Correlation IDs: Link logs and traces across services. Essential for debugging.
- Cardinality: Unique values for metric labels. High cardinality (millions of unique values) breaks storage.
Instructions
More from sethdford/claude-skills
api-test-automation
Expert approach to api-test-automation in test automation. Use when working with .
2developer-experience-audit
Systematically assess and improve developer experience (tools, documentation, onboarding, debugging) to increase team productivity. Use in roadmapping or when noticing developer friction.
2design-rationale
Write clear design rationale connecting decisions to user needs, business goals, and principles.
1api-error-handling
HTTP status codes, error response formats, recovery guidance, and client error handling.
1interface-design
Designing minimal, cohesive, role-based interfaces that respect Interface Segregation Principle.
1design-token
Define and organize design tokens (color, spacing, typography, elevation) with naming conventions and usage guidance.
1