Giter VIP home page Giter VIP logo

lyragen's People

Contributors

bdkiran avatar

Watchers

 avatar  avatar

lyragen's Issues

Add Ability to Scrape Songs by Artist

Add logic to the application that

  • Fetches all songs from az lyrics from a certain artist

  • Fetches all lyrics from the artist

We need this to have clean data regarding Artist Name and Song Name(currently all lowercase)

Group Duplicate Lyrics in Same Document

To resolve an issue of duplicate lyrics being returned, the best solution is to return store the duplicate lyrics in elastic as a single document. This document will represent if there are multiple lyrics by representing the multiple positions that they occur within the song.

  • Duplicate Song lyrics are detected

  • They are stored in a data structure that tracks the occurrence by line number

  • These data structures are appropriately transformed to be stored in elasticsearch

PostgreSQL Main Storage for songs

Convert songs meta data storage to PostgreSQL DB instead of files.
Table songs
-song_id
-title
-artist
-album
-year
-lyrics_fetched

  • Drop table

  • Script completed, needs to be rerun to on both folders

Uploading Strings not valid Chars

Chars that are scraped from A-Z Lyrics that contain " ' " are being transformed to odd characters.
When scraping, these chars need to be cleaned then uploaded to elasticsearch

Add Multiple Song Upload

Using the json data structure dependency. Be able to update the database with multiple songs

  • Add retry mechanism to accommodate when the API fails

  • Update json document to reflect that a song was successfully updated

  • Implement proper mechanisms to avoid rate limiting/bans from AZ lyrics

Update README

Update readme to:

  • User proper markdown

  • Explain what the project is for

  • Instructions for downloading and dependencies

Get Song Lyrics From DB

Song Lyrics should be able to be fetched from DB instead of from files.

To avoid Rate limiting issues:

  • Either limit by amount

  • Limit by artist

Clean Up JSON Documents

Have one JSON document used to feed in new songs

  • Contains information Artist and Song Title

  • Contain information if the track was successfully uploaded to db

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.