spark-version-upgrade
Installation
SKILL.md
Upgrade Apache Spark applications between major versions with a structured, phase-by-phase workflow.
When to Use
- Migrating from Spark 2.x → 3.x or Spark 3.x → 4.x
- Updating PySpark, Spark SQL, or Structured Streaming applications
- Resolving deprecation warnings before a Spark version bump
Workflow Overview
- Inventory & Impact Analysis — Scan the codebase and assess scope
- Build File Updates — Bump Spark/Scala/Java dependencies
- API Migration — Replace deprecated and removed APIs
- Configuration Migration — Update Spark config properties
- SQL & DataFrame Migration — Fix query-level breaking changes
- Test Validation — Compile, run tests, verify results