Hi Paul,

yeah there is a dump command

bin/nutch readlinkdb crawl/linkdb/ -dump dumpdir
You can also dump the CrawlDB, but I dont know if the complete data are
dumpable and this is usefull for you...

HTH

Mario

Paul Tomblin wrote:
> The nutch data files are pretty opaque, and even "strings" can't extract
> anything except the occasional URL.  Is there any code to dump the contents
> of the various files in a human readable form?
>
>   

-- 

Mario Schröder | http://www.finanz-checks.de
Office: +49 361 2152062
Phone: +49 34464 62301 Cell: +49 163 27 09 807
http://www.xing.com/go/invite/6035007.9c143c

Reply via email to