In this project, a ETL pipeline using Spotify API and AWS was build. The pipeline retrieves data from the Spotify API, transforms it to a desired format, and loads it into an AWS data store.
Spotify API containing information among other things, about artists, albumbs and songs
-
Amazon S3 (Simple Storage Services)
-
AWS Lamda
-
Cloud Watch
-
Glue Crawler
-
Data Catalog
-
Amazon Athena
pip install pandas
pip install spotipy
Project Execution Flow
Extrac Data from API -> Lamda Trigger (choosen frequency) -> Run extract Code -> Store the Raw Data -> Trigger the Transform Function -> Transform the Data and load it -> Query using Athena
