debug-buttercup

Installation
Summary

Diagnose and resolve Buttercup CRS pod crashes, cascading failures, and service misbehavior on Kubernetes.

  • Covers 20+ services across fuzzing, analysis, orchestration, and infrastructure layers; includes triage workflow, log analysis, resource pressure diagnosis, and Redis queue inspection
  • Provides cascade-failure detection (e.g., Redis down triggering mass restarts), health-check file monitoring, and per-service failure patterns
  • Supports OpenTelemetry/Signoz distributed tracing, volume and storage verification, and Helm configuration validation against actual pod state
  • Includes automated diagnostic script and detailed queue inspection commands for Redis streams and consumer groups
SKILL.md

Debug Buttercup

When to Use

  • Pods in the crs namespace are in CrashLoopBackOff, OOMKilled, or restarting
  • Multiple services restart simultaneously (cascade failure)
  • Redis is unresponsive or showing AOF warnings
  • Queues are growing but tasks are not progressing
  • Nodes show DiskPressure, MemoryPressure, or PID pressure
  • Build-bot cannot reach the Docker daemon (DinD failures)
  • Scheduler is stuck and not advancing task state
  • Health check probes are failing unexpectedly
  • Deployed Helm values don't match actual pod configuration

When NOT to Use

  • Deploying or upgrading Buttercup (use Helm and deployment guides)
  • Debugging issues outside the crs Kubernetes namespace
  • Performance tuning that doesn't involve a failure symptom
Related skills
Installs
1.8K
GitHub Stars
5.1K
First Seen
Feb 11, 2026