harvard-art-museums-data-pipeline

Installation
SKILL.md

Harvard Art Museums Data Pipeline Skill

Skill by ara.so — Data Skills collection.

Overview

The Harvard Artifacts Collection Data Engineering & Analytics App is an end-to-end data pipeline that extracts artifact data from the Harvard Art Museums API, transforms it into relational tables, loads it into SQL databases (MySQL/TiDB), and provides interactive analytics through a Streamlit dashboard. The project demonstrates production-grade ETL patterns, SQL analytics, and data visualization.

Architecture Flow: API → ETL → SQL → Analytics → Visualization

Installation

# Clone the repository
git clone https://github.com/Manali0711/Harvard-Artifacts-Collection-Data-Engineering-Analytics-App.git
cd Harvard-Artifacts-Collection-Data-Engineering-Analytics-App

# Install dependencies
pip install -r requirements.txt
Installs
344
GitHub Stars
1
First Seen
May 23, 2026
Security Audits
harvard-art-museums-data-pipeline — aradotso/data-skills