gke-workload-scaling
Installation
SKILL.md
GKE Workload Scaling
This skill provides workflows and best practices for scaling applications on Google Kubernetes Engine (GKE). It covers manual scaling, Horizontal Pod Autoscaling (HPA), and Vertical Pod Autoscaling (VPA).
Workflows
1. Manual Scaling
Quickly scale a deployment to a fixed number of replicas. Useful for immediate manual intervention or testing.
Command:
kubectl scale deployment <deployment-name> --replicas=<number> -n <namespace>
2. Horizontal Pod Autoscaling (HPA)
Automatically scale the number of pods based on observed CPU utilization, memory utilization, or custom metrics.