data-pipeline-manager
Data Pipeline Manager
Purpose
The Data Pipeline Manager skill provides comprehensive guidance for designing, implementing, and troubleshooting data processing pipelines with emphasis on quality validation, error handling, and reliability. Data pipelines are a critical component of modern bioinformatics and data science workflows, yet they are a prevalent source of failures in automated systems. Poor data quality, format mismatches, missing files, and transient errors can cascade through pipelines, causing failures that are difficult to diagnose and fix.
This skill addresses these challenges by providing structured approaches to pipeline design, validation strategies at each stage, robust error handling patterns, and monitoring techniques. Whether you're building a new RNA-seq analysis pipeline, debugging a failed ETL job, or ensuring data quality across complex multi-stage workflows, this skill provides the frameworks and best practices needed for success.
The skill integrates closely with the bioinformatician skill (which implements specific bioinformatics pipelines) and the systems-architect skill (which designs the overall system architecture), providing the critical middle layer of pipeline orchestration, validation, and reliability.
When to Use This Skill
Use the Data Pipeline Manager skill when you need to:
Pipeline Design and Implementation
- Design new data processing pipelines from scratch
- Set up RNA-seq, ChIP-seq, or other genomics processing pipelines
- Build ETL (Extract, Transform, Load) workflows for data integration
More from dangeles/claude
procurement
Use when equipment specifications need matching to potential vendors, sourcing landscape must be mapped (catalog items vs. custom orders), or lead time considerations affect project planning
73bioinformatician
Use when implementing data analysis pipelines, statistical tests, or bioinformatics workflows in code (Python/R), particularly for genomics, transcriptomics, proteomics, or other -omics data.
49mathematician
Use when designing algorithms, analyzing complexity, selecting numerical methods, or verifying mathematical correctness for software implementations.
36statistician
Use when selecting statistical methods, performing power analysis, guiding uncertainty quantification, or validating MCMC/Monte Carlo implementations.
36consistency-auditor
Use when parameter values appear in multiple documents and consistency must be verified, especially for quantitative values that may differ due to measurement context or require reconciliation
25researcher
Use when comprehensive literature research is needed, especially when quantitative parameters must be sourced from primary literature with proper citations and context (species, measurement methods, culture conditions)
25