Giter VIP home page Giter VIP logo

eminem_nlp's Introduction

Building a NLP ETL Pipeline and Conducting Analysis on Eminem Lyrics

Building a pipeline in python to clean and pre-process song lyrics to perform analysis. Eminem's lyrics were scrapped from lyricsGenius, a Python library by Johnwmillr on December 22, 2020.

Analysis performed:
  • WordCloud (EminemWC.png)
  • Text Analysis
  • Sentiment Analysis

Frameworks

  • lyricsgenius - Library for scrapping data from Genius.com
  • Pandas - Library for Dataframe manipulation
  • Json - Json processing library
  • Numpy - Library used for large, high-level mathematics
  • langdetect - Language detection library
  • NLTK - Natural Language processing ToolKit
  • PIL - Image processing library
  • Wordcloud - Library used for making wordclouds
  • Matplotlib - Library used for visualizations

How to run

You can clone the whole repo and run the .ipynb files on Jupyter Notebook or Google Collab. Comments and detailed explaination of the code are in these files as well.

Files

NOTE : The resulting files when running the above files will be saved in the working directory. However, all the output files are stored in files folder here for organizing reasons.

Eminem WordCloud

eminem_nlp's People

Contributors

thebadcoder96 avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.