Lucene handles all compression on the index. The most recent details on this can always be found on this page (within the Lucene site): http://lucene.apache.org/java/docs/fileformats.html
----- Original Message ---- From: Paul Liddelow <[EMAIL PROTECTED]> To: [EMAIL PROTECTED] Sent: Sunday, April 15, 2007 3:28:00 AM Subject: Index compression Hi I'm just wondering about the index that Nutch creates and whether it is compressed in any way. I have checked through all the mailing list entries and can't find anything about compression. I found something on Sami Siren's blog that mentioned it but it didn't really answer my question. My question is: does Nutch compress the index after the crawl? Or is this something Lucene does? Or do you need to call a compression plugin (if one exists) to nutch-site.xml? Or are the segments compressed? If anybody can shed some light on this matter that would be great. Thanks Paul
