Is there a way to clear out the various databases that Nutch uses (e.g.
crawldb)? I did some testing which injected a lot of URLs into the DB that
I want to clear out as I move toward production.
Thanks!
CT
--
View this message in context:
What do in this case is to erase the db, use the.command mergesegs with
-filter option and then updatedb.
I would.love to know if there is a simpler way
Remi
On Monday, February 27, 2012, Charles Thomas ctho...@wisc.edu wrote:
Is there a way to clear out the various databases that Nutch uses
2 matches
Mail list logo