Big Data Project - Netflix, Imdb, Rotten Tomato #1 This is our first project in Big Data, the aim of the project was to analyze movies rating dataset and integrate multiple databases with record linkage's techniques. The third aim was to understand if Critics are too critical for Users. We focused on the non-relational database management system like MongoDB. Databases:
Netflix Imdb Rotten Tomatoes Tmdb
You can find:
- Data Ingestion Techniques
- Data Cleaning
- Scraping Data
- Use of Pymongo
- Data Analysis
- Data Processing
We used MongoDb Cluster to achieve a good perfomance in terms of computational power.
Contributors: Lorenzo Famiglini and Giorgio Bini