kubespray-troubleshooting
Installation
SKILL.md
Kubespray Troubleshooting
Overview
Diagnose and fix common Kubespray deployment failures. Most failures stem from network misconfiguration, etcd issues, or stale state from previous attempts.
Core principle: Read the exact task name that failed, check logs on that specific node, then fix and re-run (Ansible is idempotent).
When to Use
- Deployment fails mid-playbook
kubeadm joinerrors- etcd health check timeouts
- Nodes stuck in NotReady state
- Certificate-related failures
Not for: Initial deployment setup (use kubespray-deployment), upgrades (use kubespray-operations), certificate renewal (use kubespray-certificates)