harvard-art-museums-etl-pipeline

Installation
SKILL.md

Harvard Art Museums ETL Pipeline

Skill by ara.so — Data Skills collection.

This skill enables you to build complete data engineering and analytics applications using the Harvard Art Museums API. The project demonstrates real-world ETL pipelines, SQL database design, analytical queries, and interactive Streamlit dashboards for museum artifact data.

What This Project Does

The Harvard Artifacts Collection Data Engineering & Analytics App provides:

  • API Integration: Collect artifact data from Harvard Art Museums API with pagination and rate limiting
  • ETL Pipeline: Extract, transform, and load museum data into relational SQL databases
  • Database Design: Multi-table schema with artifact metadata, media, and color information
  • SQL Analytics: 20+ predefined analytical queries for data insights
  • Interactive Visualization: Streamlit dashboard with Plotly charts for real-time analytics

The architecture follows: API → ETL → SQL → Analytics → Visualization

Installation

Installs
234
GitHub Stars
1
First Seen
12 days ago
harvard-art-museums-etl-pipeline — aradotso/data-skills