harvard-art-museums-etl-pipeline
Installation
SKILL.md
Harvard Art Museums ETL Pipeline
Skill by ara.so — Data Skills collection.
This skill enables you to build complete data engineering and analytics applications using the Harvard Art Museums API. The project demonstrates real-world ETL pipelines, SQL database design, analytical queries, and interactive Streamlit dashboards for museum artifact data.
What This Project Does
The Harvard Artifacts Collection Data Engineering & Analytics App provides:
- API Integration: Collect artifact data from Harvard Art Museums API with pagination and rate limiting
- ETL Pipeline: Extract, transform, and load museum data into relational SQL databases
- Database Design: Multi-table schema with artifact metadata, media, and color information
- SQL Analytics: 20+ predefined analytical queries for data insights
- Interactive Visualization: Streamlit dashboard with Plotly charts for real-time analytics
The architecture follows: API → ETL → SQL → Analytics → Visualization