data-analysis
SKILL.md
Data Analysis
Expert guidance for data analysis and data science workflows. Provides tool selection, workflow patterns, and curated resources across the full data stack.
Tool Selection Quick Reference
Data Manipulation
| Need | Recommended Tool | Alternative |
|---|---|---|
| General tabular data | Pandas | Polars (faster, multithreaded) |
| Large datasets (out-of-core) | Polars, Dask | Vaex, Modin |
| GPU-accelerated DataFrames | cuDF (RAPIDS) | CuPy (NumPy on GPU) |
| Parallel pandas operations | Pandarallel | Modin |
| Cross-engine portability | Fugue (Pandas/Spark/Dask) | - |
| Fuzzy string matching | TheFuzz | - |
| Date/time handling | Pendulum, Arrow | DateUtil |