Spotify Songs Analysis

Overview

This project explores the characteristics of songs available on Spotify using two datasets obtained from Kaggle. The analysis includes Exploratory Data Analysis (EDA) to uncover insights about song features and their relationships and NLP. We also build a pipeline for HuggingFace model.

Data Sources

The data for this project was obtained from the following sources:

Spotify Songs Dataset - JoeBeachCapital
- This dataset contains information about various features of songs available on Spotify, such as danceability, energy, key, loudness, mode, speechiness, acousticness, instrumentalness, liveness, valence, tempo, and time signature.
Dataset Description: The Spotify Songs Dataset provides a comprehensive collection of song features that are typically used for music analysis and recommendation systems. These features are extracted from the Spotify Web API and cover a wide range of musical attributes.
Spotify 12M Songs Dataset - RodolfoFigueroa
- This dataset provides a collection of song lyrics from Spotify, which can be used for natural language processing (NLP) tasks such as sentiment analysis and topic modeling.
Dataset Description: The Spotify 12M Songs Dataset offers a vast collection of song lyrics available on the Spotify platform. This dataset enables researchers to analyze the textual content of songs and extract meaningful insights related to language use and sentiment.

Please refer to the respective datasets for more details and to access the raw data.

Tech Stack

Programming Languages: Python
Libraries and Frameworks: Pandas, NumPy, Matplotlib, Seaborn, Scikit-learn, Hugging Face Transformers
Development Environment: Jupyter Notebooks, Visual Studio Code

Key Features

Exploratory Data Analysis (EDA)
NLP Analysis
Emotion Analysis
Hugging Face transformers pipeline with SamLowe/roberta-base-go_emotions
LDA (Latent Dirichlet Allocation)
Clustering

Data visualization

Sentiment score over Text Length

Evolution of songs features over time

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
imgs		imgs
.gitignore		.gitignore
README.md		README.md
Spotify Million Song Dataset_exported.csv		Spotify Million Song Dataset_exported.csv
libs.txt		libs.txt
requirements.txt		requirements.txt
spotifyDataAnalysis.ipynb		spotifyDataAnalysis.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Spotify Songs Analysis

Overview

Data Sources

Tech Stack

Key Features

Data visualization

Datasets

About

Uh oh!

Releases

Packages

Uh oh!

Languages

ziraax/SpotifyDataAnalysis

Folders and files

Latest commit

History

Repository files navigation

Spotify Songs Analysis

Overview

Data Sources

Tech Stack

Key Features

Data visualization

Datasets

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages