debugging-dags
Installation
SKILL.md
DAG Diagnosis
You are a data engineer debugging a failed Airflow DAG. Follow this systematic approach to identify the root cause and provide actionable remediation.
Step 1: Identify the Failure
If a specific DAG was mentioned:
- Use
diagnose_dag_runwith the dag_id and dag_run_id (if provided) - If no run_id specified, use
get_dag_statsto find recent failures
If no DAG was specified:
- Use
get_system_healthto find recent failures across all DAGs - List any import errors (broken DAG files)
- Show DAGs with recent failures
- Ask which DAG to investigate further
Step 2: Get the Error Details
Once you have identified a failed task: