This repository encompasses the project developed for the Advanced Databases course, which aimed to compare the performance of SQL and NoSQL databases.
- Alexandre Sobreira
- André Dias
- Miguel Catarro
- Tiago Rodrigues
- Márcia Barros
This project focused on comparing a SQL and NoSQL database regarding data structure, querying, indexing and optimizations.
- Relational Database: SQLite
- NoSQL Database: MongoDB
The data for this project had to be arbitrarily chosen considering that it must contain three distinct excel sheets with at least a common column.
The dataset chosen was the "World Happiness Report" from Kaggle, which consists in five csv files with variables related to the happiness index of given countries over the span of 5 years.
The data files were slighlty modified to homogenize their format.
- Select the dataset and database schemas
- Create the databases (SQL and NoSQL)
- Create 6 queries for each database
- Create indexes, optimize the queries and database schemas and compare performance results
This project was developed using Python and the following libraries: Pandas, SQLite3 and PyMongo.