observability-k8s-investigation
Installation
SKILL.md
Kubernetes Investigation
Diagnose Kubernetes issues using OTel telemetry collected via EDOT (Elastic Distribution of OpenTelemetry) and the kube-stack collector. Correlate cluster state, pod runtime metrics, K8s events, application logs, and APM to identify root cause across the workload, node, and control-plane layers.
Scope
In scope: OTel-receiver-namespaced indices (metrics-kubeletstatsreceiver.otel-*,
metrics-k8sclusterreceiver.otel-*, logs-k8seventsreceiver.otel-*, logs-k8sobjectsreceiver.otel-*) and OTel
semantic conventions (k8s.pod.name, k8s.namespace.name, k8s.container.restarts).
Out of scope:
- The legacy Elastic Agent Kubernetes integration (
metrics-kubernetes.*,logs-kubernetes.*,kubernetes.*fields). Being deprecated — do not author queries against these paths. - APM-layer analysis (service SLO breaches, transaction error rates, upstream dependency health). Different domain — once a K8s root cause is ruled in or out, APM investigation continues outside this skill.
- Cluster provisioning, capacity planning, cost optimization. Different domain.