python3 wikipedia_v2.py
--dir directory for saving files
--namespace
--titlesdir directory to page_titles.txt
--download whether to download files
--f (if not download) directory
to unzipped .xml file
--idx index of the file to process
( 0 - 215 )
bill10 / wikipedia_parser Goto Github PK
View Code? Open in Web Editor NEWThis project forked from hanguo97/wikipedia_parser