h4ck3rm1k3 / nutch-mongdb-parser Goto Github PK
View Code? Open in Web Editor NEWAllows the easy seeding of urls from Mongodb into Nutch. This is similar in nature to that of the DmozParser that comes with Nutch. This provides a way to bootstrap and seed Nutch with data coming directly from Mongodb. The injector add urls from a specified mongodb to the crawldb of your choice.
License: Apache License 2.0