Calculate the number of URLs in the top 100 and the number of occurrences within 1GB of memory.
In this project, a python script createURL.py
is used to generate the URL.txt
, which contains a random number of different URLs. Using createURL.py
, you can generate a URL record of about 20GB. In URL.txt
, each URL occupies one line.
-
Requirements for
Python 3
andgo 1.12.1
-
OS: Linux
- Clone the
URLCount
repository.
git clone https://github.com/AkinoRito/URLCount.git
cd URLCount
- Use
createURL.py
to generate dataset.
python createURL.py
โ Generating URLs may take some time.
- Execute Golang scripts
go run main.go utils.go
- Execute the test script
go test -v