harvard-art-museums-data-engineering-pipeline

Installation
SKILL.md

Harvard Art Museums Data Engineering Pipeline

Skill by ara.so — Data Skills collection.

Overview

This project demonstrates a production-grade data engineering workflow that:

  • Extracts artifact data from Harvard Art Museums API with pagination and rate limiting
  • Transforms nested JSON into normalized relational tables
  • Loads data into MySQL/TiDB Cloud databases
  • Executes analytical SQL queries for insights
  • Visualizes results through interactive Streamlit dashboards

The application showcases real-world ETL patterns, database design, and data visualization techniques used in analytics engineering roles.

Installation

Installs
225
GitHub Stars
1
First Seen
12 days ago
harvard-art-museums-data-engineering-pipeline — aradotso/data-skills