Movie Recommender is a simple recommender system that seeks to generate top recommendations for each user based on their past references.
- Python 3.6
- Pandas 0.24
- Pyspark 2.4
Explored movies.csv, ratings.csv and tags.csv. Removed unnecessary columns and calculated weighted average rating of each user and movie based on IMDB's formula. Analysed tag frequency, top movie within each genre and other data visualization. Implemented baseline, collaborative filtering and content based approaches.
Scaling up the pipeline using pyspark. For matrix factorization, ALS is used. The best RMSE is 0.880.