crawldb modifications

2012-02-27 Thread Charles Thomas
Is there a way to clear out the various databases that Nutch uses (e.g. crawldb)? I did some testing which injected a lot of URLs into the DB that I want to clear out as I move toward production. Thanks! CT -- View this message in context:

Re: crawldb modifications

2012-02-27 Thread remi tassing
What do in this case is to erase the db, use the.command mergesegs with -filter option and then updatedb. I would.love to know if there is a simpler way Remi On Monday, February 27, 2012, Charles Thomas ctho...@wisc.edu wrote: Is there a way to clear out the various databases that Nutch uses