deploy-cluster

Installation
SKILL.md

Deploy SkyPilot TPU Cluster on GKE

This skill deploys a SkyPilot-managed TPU cluster on an existing GKE cluster. It builds on the apply-resource skill which handles GKE cluster creation via xpk.

Key Feature: Each TPU type gets its own SkyPilot cluster (named <cluster>-<username>-<tpu_type>), allowing multiple topologies to run in parallel on the same GKE cluster. Node pools are automatically managed per TPU type.

Prerequisites

  • SkyPilot: pip install skypilot
    • Check: sky --help
  • Google Cloud SDK (gcloud): Install guide
    • Run gcloud auth login to authenticate
  • Kubectl: Install guide

Defaults

The following defaults apply unless the user explicitly overrides them:

Related skills
Installs
1
First Seen
6 days ago