Two years back we had a client who wanted a quick solution for getting all the products on several online shops. Spcifically, we had to find all the products' UPCs on the specified websites.
- scraping a page
- HTTPoison
- Floki
- scraping multiple pages
- explain the algorithm
- Stream
- Eager vs Lazy
- Stream.resource(start, next, after)
- The subtle, but important difference between Concurrency and Parallelism
- Measure execution time
- Checking available CPUs with
:observer.start()
- Crawl concurrently with
Task.async_stream
- Embarassingly parallel?