You can download the entire IMDB dataset and extract the files to the data
folder. Once this is done rename the files which makes more sense while you load the data in Python using Pandas.
- Python
- Pandas
- Modin
- Matplotlib
On Windows, you would need the Windows Subsystem for Linux (WSL) to run Pandas with Modin. You might need to install the update on your Windows 10 machine or enable it from the Add/Remove Program. There is a dependency Modin uses (ray) which is not available on Windows. After WSL is enabled, you can use the same commands as you use on Linux to install the other dependecies/prerequisites.
Python is install by default on all major Linux distribution. You can then install other dependencies by using the pip
commands.