Spotify-Data-Analysis

Spotify dataset is downloaded from Kaggle. This database contains information about artists, musical charecteristics, year and popularity of 1.7 L songs of Spotify for past 100 years.

We've tried to quantify relationships between different features and their impact on songs getting popularity.

We've used libraries like pandas, numpy, scikitlearn, matplotlib and seaborn to carry out data preprocessing, data visualization and modelling.

This study has provided us good hands on and insights for:

Linear Regression
Logistic Regression
Decision Tree Classification
KNN Classification
random Forest Classification

From models and visuals, it can be definitely said that the popularity is some what biased towards the year of the song. This is unclear how popularity has been measured, wether it was song's popularity in the years of their release or in today's date. It is obvious that recent songs would be more popular among people from 2020.

Future scope is to develop NN model using TensorFlow and will be uploaded soon.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
README.md		README.md
Spotify_Models.ipynb		Spotify_Models.ipynb
Spotify_Preprocessing.ipynb		Spotify_Preprocessing.ipynb
Spotify_Visualization.ipynb		Spotify_Visualization.ipynb
data.csv		data.csv
data_preprocessed.csv		data_preprocessed.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Spotify-Data-Analysis

About

Releases

Packages

Languages

Yash-Parikh/Spotify-Data-Analysis

Folders and files

Latest commit

History

Repository files navigation

Spotify-Data-Analysis

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages