Skip to content

JKFLows/Spotify_data_pipeline

Repository files navigation

Spotify ETL Data Engineering Project

Intro

In this project, a ETL pipeline using Spotify API and AWS was build. The pipeline retrieves data from the Spotify API, transforms it to a desired format, and loads it into an AWS data store.

Architecture

Architecture Diagram

Dataset/API

Spotify API containing information among other things, about artists, albumbs and songs

Services Used

  1. Amazon S3 (Simple Storage Services)

  2. AWS Lamda

  3. Cloud Watch

  4. Glue Crawler

  5. Data Catalog

  6. Amazon Athena

Install Packages

pip install pandas
pip install spotipy

Project Execution Flow

Extrac Data from API -> Lamda Trigger (choosen frequency) -> Run extract Code -> Store the Raw Data -> Trigger the Transform Function -> Transform the Data and load it -> Query using Athena

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors