Giter VIP home page Giter VIP logo

spotify_music_analysis's Introduction

Spotify_Music_Analysis

Summary

Music industry has revolutionized since the start of streaming services. Spotify is a top streaming service with over 50 million tracks and 280 million monthly active users. We have already seen how personalized recommendation has brought millions in users and profits to the company. Algorithms like Collaborative Filtering started the rise of using Machine learning to help personalize content for users on streaming platforms. The objective of this paper is a little different, we want to see what constitutes for a “Hit” or “Flop” song on Spotify, we will also try to predict if a song is “Hit” or “Flop” using various algorithms and finally, we will recommend older songs based on the audio features of current “Hit” songs. We used Accuracy to evaluate the performances of our models. We found one algorithm stood out more than the rest and Instrumentalness is a feature that makes a song “Hit” or “Flop” and in the end, We recommended some old songs that has audio features of current “Hit” songs.

  1. Introduction

Spotify allows users to scrape data and audio features for each of its tracks, in our dataset we have audio features from various kinds of music and of different years. Since music changes so rapidly, we will see over the years what has been a factor in making a song “Hit” or “flop”. We will generate a classifier to predict outcomes(“Hit/Flop”) and other algorithms to help us deduce what features make a “Hit” or “Flop” song on Spotify and in the end, we will generate a list of Old songs which are similar to the current “Hit” songs.

  1. Data Description

"Spotifydata.csv" is the dataset with the latest tracks and its audio features and “dataset_of_00s” and “dataset_of_10s” are datasets with older tracks from Years 2000’s and 2010’s. We have combined the “dataset_of_00s” and dataset_of_10s” into one dataset. The datasets have 17 columns related to a song’s features such as mode, key, etc. The Spotifydata.csv has an additional column known as “target” where the value is 1 if “Hit” and 0 for “Flop”. (Read more about the features here: https://developer.spotify.com/documentation/web-api/reference/tracks/get-audio-features/)

  1. Goal

There are three main objectives we want to achieve from the project

  1. We want to able to classify if a Song is “Hit” or “Flop”.

  2. We want to quantify the importance of the audio features and how it affects in classifying a “Hit” or “Flop” song.

  3. We want to recommend “Hit” songs from the past based on the audio features of “Hit” songs of the present.

  4. Conclusion

We used Random Forest to classify the songs and we saw It is easier to predict “Hit” songs of the past than the present based on its audio features. We had 79% accuracy while predicting the outcome of latest music. Whereas 86% accuracy while predicting the outcome of older music. Most important audio feature from songs from the past or present is Instrumentalness of the music. We also found list of old songs that may have similar audio features to the present “Hit” songs.

spotify_music_analysis's People

Contributors

sumeetksingh avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.