The nutch data files are pretty opaque, and even "strings" can't extract anything except the occasional URL. Is there any code to dump the contents of the various files in a human readable form?
-- http://www.linkedin.com/in/paultomblin
The nutch data files are pretty opaque, and even "strings" can't extract anything except the occasional URL. Is there any code to dump the contents of the various files in a human readable form?
-- http://www.linkedin.com/in/paultomblin