harvard-artifacts-etl-streamlit
Installation
SKILL.md
Harvard Artifacts Collection Data Engineering & Analytics App
Skill by ara.so — Data Skills collection.
This skill enables AI coding agents to help developers build and work with an end-to-end data engineering application that collects artifact data from the Harvard Art Museums API, performs ETL operations, stores data in SQL databases, and creates interactive analytics dashboards using Streamlit.
What This Project Does
The Harvard Artifacts Collection app demonstrates a complete data pipeline:
- Extract: Fetches artifact data from Harvard Art Museums API with pagination
- Transform: Processes nested JSON into relational database schema
- Load: Batch inserts into MySQL/TiDB Cloud with foreign key relationships
- Analyze: Runs 20+ predefined SQL analytical queries
- Visualize: Displays results in interactive Streamlit dashboards with Plotly charts
The architecture follows: API → ETL → SQL → Analytics → Visualization