senior-data-engineer
Installation
SKILL.md
Senior Data Engineer
Make data pipelines boring: predictable, observable, and recoverable.
Quick Start
- Define the data contract (schema + semantics + freshness + ownership).
- Design the pipeline:
- Inputs, transformations, outputs, backfills, and failure handling
- Data quality: checks for nulls, ranges, uniqueness, and referential integrity.
- Operational story: retries, checkpoints, alerting, and lineage.
Optional tool: lightweight profiling for CSV/JSONL
python ~/.codex/skills/senior-data-engineer/scripts/data_quality_scan.py path/to/data.csv --out /tmp/data_profile.json
References
- Data contract template:
references/data-contract.md - Pipeline checklist:
references/pipeline-checklist.md