data-science-visualization
Data Visualization
Use this skill for creating effective visualizations: choosing the right library, chart type, and interactivity level for your data and audience.
When to use this skill
- Choosing a visualization library for a project
- Creating exploratory charts during EDA
- Building interactive dashboards
- Producing publication-quality figures
- Understanding tradeoffs between libraries
Library selection guide (2026)
| Library | Best For | Interactivity | Learning Curve |
|---|---|---|---|
| Matplotlib | Publication-quality static plots, fine control | Static | Moderate |
| Seaborn | Statistical visualization, quick EDA | Static | Easy |
| Plotly | Interactive web charts, dashboards | High | Easy |
More from legout/data-platform-agent-skills
data-science-eda
Exploratory Data Analysis (EDA): profiling, visualization, correlation analysis, and data quality checks. Use when understanding dataset structure, distributions, relationships, or preparing for feature engineering and modeling.
13data-engineering-core
Core Python data engineering: Polars, DuckDB, PyArrow, PostgreSQL, ETL patterns, performance tuning, and resilient pipeline construction. Use when building or reviewing batch ETL/dataframe/SQL pipelines in Python.
10data-science-feature-engineering
Feature engineering for machine learning: encoding, scaling, transformations, datetime features, text features, and feature selection. Use when preparing data for modeling or improving model performance through better representations.
10data-science-notebooks
Interactive notebooks for data science: Jupyter, JupyterLab, and marimo. Use for exploratory analysis, reproducible research, documentation, and sharing insights with stakeholders.
9data-engineering-best-practices
Data engineering best practices: medallion architecture, dataset lifecycle, partitioning, file sizing, schema evolution, and append/overwrite/merge patterns across Polars, PyArrow, DuckDB, Delta Lake, and Iceberg. Use when designing production data pipelines or reviewing data platform decisions.
8data-engineering-storage-formats
Modern data serialization formats: Parquet, Apache Arrow (Feather/IPC), Lance (ML-native), Zarr (chunked arrays), Avro, and ORC. Covers compression, partitioning, and format selection.
8