Well, you can dump the crawldb using the bin/nutch readdb command. You'd still 
need to parse the output youself to get a decent list of URL's.

> Hi guys,
> 
> I was wondering if there is a quick method to dump all urls of a merged
> index (ie a production index).
> I want to use them them  for a 'fresh' seeding of a new crawldb

Reply via email to