Hi I'm just wondering about the index that Nutch creates and whether it is compressed in any way. I have checked through all the mailing list entries and can't find anything about compression. I found something on Sami Siren's blog that mentioned it but it didn't really answer my question.
My question is: does Nutch compress the index after the crawl? Or is this something Lucene does? Or do you need to call a compression plugin (if one exists) to nutch-site.xml? Or are the segments compressed? If anybody can shed some light on this matter that would be great. Thanks Paul
