An advanced web scraper tool that seamlessly fetches and combines over 200 online tutorials into a convenient offline PDF format.
Please support here: https://www.learncpp.com/about/
Pre-requisites: If you are a Windows user, head over to https://wkhtmltopdf.org/downloads.html and configure the package.
On Linux/MacOS system, you should install this package using your package manager: wkhtmltopdf
-
Local (recommended)
Requires Python 3.10.
git clone https://github.com/amalrajan/learncpp-download.git cd learncpp-download pip install -r requirements.txt
-
Docker
docker pull amalrajan/learncpp-download:latest docker run --rm --name=learncpp-download --mount type=bind,destination=/app/downloads,source=<host-downloads-path> --shm-size=1.14gb amalrajan/learncpp-download
Replace
<host-downloads-path>
with a local path on your system where you'd want the files to get downloaded.
python main.py
The PDF files will get downloaded into a newly created "downloads" folder, right under "source" directory.
Thanks to https://github.com/ray-project/ray, the processing time is now reduced to ~5 seconds from the previous 300 seconds, on a decent system. Watch out, this uses a lot more CPU and memory.
Feel free to open a new issue here: https://github.com/amalrajan/learncpp-download/issues. Please attach the console logs along.