Machine Learning model that predicts which page best to choose to reach the goal page.
This is a final project for "Introduction to Machine Learning by Università di Macerata" course. Created by Dmytro Zakharov, Daniil Korenkov, Kvaratskheliia David.
You can download it here or from the presentation
folder.
internal/dataset_generator
— logic responsible for generating the dataset. Main function here isweb_clicker.py
which launches the browser from the web driver, open the Six Degrees of Wikipedia website and mines the data. In theinputs
file we included the list of words that is used for the model and forming the input.output
is a place where the generated table is stored.internal/model
— place where we trained and saved our model.interface
— scripts that launches the colorful CLI for interacting with a model.
From the project root folder type in:
python3 main.py -m generate
to launch dataset minerpython3 main.py -m six-degrees-chart
to show distances distribution from the Six Degrees of Wikipedia websitepython3 main.py -m generator-chart
to show distances distribution for our datasetpython3 main.py -m generate-features
to convert generated dataset to one needed for the model (with features)