harvard-artifacts-collection-etl-analytics
Installation
SKILL.md
Harvard Artifacts Collection ETL Analytics
Skill by ara.so — Data Skills collection
This skill enables you to build end-to-end data engineering and analytics applications using the Harvard Art Museums API. It demonstrates real-world ETL pipelines, SQL database design, analytical queries, and interactive data visualization using Streamlit.
What It Does
The Harvard Artifacts Collection ETL Analytics project provides:
- API Integration: Fetch artifact data from Harvard Art Museums API with pagination and rate limiting
- ETL Pipeline: Extract, transform, and load nested JSON data into relational SQL tables
- Database Design: Structured schema with
artifactmetadata,artifactmedia, andartifactcolorstables - SQL Analytics: 20+ predefined analytical queries for insights
- Interactive Dashboards: Streamlit-based UI with Plotly visualizations