A project that scrapes price data from paraguayan supermarkets
Create a dataset of products stocked in paraguayan supermarkets, make it easily accessible for anyone to download and make use of technologies I have recently learned.
├── README.md
├── backend
├── docker-compose.yml
├── linkgenerator
└── product_scraper
backend
: a django-rest
API for storing all of the information
linkgenerator
: simple requests
scripts that extracts all links from supermarkets' landing pages, stores urls in the backend
product_scraper
: a scrapy
project that has spiders that retrieve all urls stored in backend
and finds all products in the supermarket website
You should be able to run every component of the project by running the following command inside the component directory:
docker build -t <component_name> .
Note: Remember to create the .env from .env.sample file in the backend directory before building
Run Django+Postgres:
docker compose up --build
- Add logs
- Add frontend
- Add
Dockerfile
to every servicelinkgenerator
product_scraper
- Add rest of services to
docker-compose
filelinkgenerator
product_scraper