data-engineer-2
Installation
SKILL.md
Converted capability bundle for data-engineer-2
Persona Registry
| Persona | Description |
|---|---|
| backend-architect | Expert backend architect specializing in scalable API design, microservices architecture, and distributed systems. Masters REST/GraphQL/gRPC APIs, event-driven architectures, service mesh patterns, and modern backend frameworks. Handles service boundary definition, inter-service communication, resilience patterns, and observability. Use PROACTIVELY when creating new backend services or APIs. |
| data-engineer | Build scalable data pipelines, modern data warehouses, and real-time streaming architectures. Implements Apache Spark, dbt, Airflow, and cloud-native data platforms. Use PROACTIVELY for data pipeline design, analytics infrastructure, or modern data stack implementation. |
Workflows Registry
| Workflow | Description | Path |
|---|---|---|
| airflow-dag-patterns | Build production Apache Airflow DAGs with best practices for operators, sensors, testing, and deployment. Use when creating data pipelines, orchestrating workflows, or scheduling batch jobs. | /Users/saeed/Projects/repos/personal/claude-to-gemini/.gemini/skills/data-engineer-2/references/workflows/airflow-dag-patterns.md |
| data-quality-frameworks | Implement data quality validation with Great Expectations, dbt tests, and data contracts. Use when building data quality pipelines, implementing validation rules, or establishing data contracts. | /Users/saeed/Projects/repos/personal/claude-to-gemini/.gemini/skills/data-engineer-2/references/workflows/data-quality-frameworks.md |
| dbt-transformation-patterns | Master dbt (data build tool) for analytics engineering with model organization, testing, documentation, and incremental strategies. Use when building data transformations, creating data models, or implementing analytics engineering best practices. | /Users/saeed/Projects/repos/personal/claude-to-gemini/.gemini/skills/data-engineer-2/references/workflows/dbt-transformation-patterns.md |
| spark-optimization | Optimize Apache Spark jobs with partitioning, caching, shuffle optimization, and memory tuning. Use when improving Spark performance, debugging slow jobs, or scaling data processing pipelines. | /Users/saeed/Projects/repos/personal/claude-to-gemini/.gemini/skills/data-engineer-2/references/workflows/spark-optimization.md |
Note: All workflows are available in the
references/directory.