data-autocleaning
Installation
SKILL.md
Data Autocleaning Skill
Automated data profiling, quality assessment, and transformation for data sourced from BigQuery or Google Cloud Storage (GCS).
When to Use
[!IMPORTANT]
You MUST use this skill for ANY task where the source is BigQuery or GCS — including seemingly simple operations like "move data" or "copy table".
- Apply to all operations on new and existing sources: copying, moving, appending, ingesting, or extracting data.
- Apply to the source node specifically, not to subsequent pipeline steps.
- Never skip Dataplex profiling (Steps 1 and 3). Always use Dataplex — not ad-hoc BigQuery profiling.
Task Execution Workflow
Related skills
More from gemini-cli-extensions/data-agent-kit-starter-pack
gcp-dataflow
Provides guidance for writing, packaging and executing Apache Beam pipelines
7gcp-spark
|
7dbt-bigquery
Expert guidance for creating, modifying, and optimizing dbt pipelines
7dataform-bigquery
Expertise in generating clean, correct, and efficient Dataform pipeline
7ml-best-practices
|
7building-data-apps
|
7