databricks-performance-tuning
Installation
SKILL.md
Databricks Performance Tuning
Overview
Optimize Databricks cluster, Spark, and Delta Lake performance.
Prerequisites
- Access to cluster configuration
- Understanding of workload characteristics
- Query history access
Instructions
Step 1: Cluster Sizing
# Cluster sizing calculator
def recommend_cluster_size(
data_size_gb: float,
complexity: str = "medium", # low, medium, high
parallelism_need: str = "standard", # standard, high
Related skills