spark-connect

Installation
SKILL.md

Spark Connect — Local Execution Against a Databricks Cluster

Databricks Connect (built on Apache Spark Connect) lets you run PySpark code locally while the actual computation happens on the remote Databricks cluster.

Benefits:

  • Local IDE/Jupyter development with full Spark semantics
  • No need to upload notebooks manually — scripts run in-place from your machine
  • Enables autonomous loops (autoresearch, CI) that submit Spark jobs per iteration

How It Works

Local Machine (your IDE / script)
        │  Spark Connect protocol (gRPC)
Databricks Cluster (<DATABRICKS_CLUSTER_ID>)
Related skills
Installs
4
Repository
bmsuisse/skills
GitHub Stars
2
First Seen
Apr 8, 2026