This project is a simple crawler for the autoscout24 website. It crawls up to 400 cars per brand/model/year combination and stores the data in a versioned directory as parquet. The data is then analyzed and visualized using some simple plots in my workbench. Note I use vscode for this so it's not a fancy notebook. Feel free to crawl yourself and visualise your own data. Below is an example, more can be found in the images folder
This is your new Kedro project with Kedro-Viz setup, which was generated using kedro 0.19.5
.
Take a look at the Kedro documentation to get started.
-
Clone the repository:
git clone https://github.com/pascalwhoop/as24_crawl.git
-
Navigate to the project directory:
cd as24_crawl
-
Install the dependencies:
pip install -r requirements.txt
-
Run the project:
kedro run