data-cleaning-pipeline-generator

Installation
SKILL.md

Data Cleaning Pipeline Generator

Generates comprehensive data cleaning and preprocessing pipelines using pandas, polars, or PySpark with best practices for handling messy data.

When to Use

  • "Clean my dataset"
  • "Generate data cleaning pipeline"
  • "Handle missing values"
  • "Remove duplicates"
  • "Fix data types"
  • "Detect and remove outliers"

Instructions

1. Analyze Dataset

import pandas as pd
Related skills
Installs
2
GitHub Stars
5
First Seen
Mar 10, 2026