Hi Paul, yeah there is a dump command
bin/nutch readlinkdb crawl/linkdb/ -dump dumpdir You can also dump the CrawlDB, but I dont know if the complete data are dumpable and this is usefull for you... HTH Mario Paul Tomblin wrote: > The nutch data files are pretty opaque, and even "strings" can't extract > anything except the occasional URL. Is there any code to dump the contents > of the various files in a human readable form? > > -- Mario Schröder | http://www.finanz-checks.de Office: +49 361 2152062 Phone: +49 34464 62301 Cell: +49 163 27 09 807 http://www.xing.com/go/invite/6035007.9c143c
