Lucene handles all compression on the index.
The most recent details on this can always be found on this page (within the
Lucene site): http://lucene.apache.org/java/docs/fileformats.html
----- Original Message ----
From: Paul Liddelow <[EMAIL PROTECTED]>
To: [EMAIL PROTECTED]
Sent: Sunday, April 15, 2007 3:28:00 AM
Subject: Index compression
Hi
I'm just wondering about the index that Nutch creates and whether it
is compressed in any way. I have checked through all the mailing list
entries and can't find anything about compression. I found something
on Sami Siren's blog that mentioned it but it didn't really answer my
question.
My question is: does Nutch compress the index after the crawl? Or is
this something Lucene does? Or do you need to call a compression
plugin (if one exists) to nutch-site.xml? Or are the segments
compressed?
If anybody can shed some light on this matter that would be great.
Thanks
Paul
-------------------------------------------------------------------------
This SF.net email is sponsored by DB2 Express
Download DB2 Express C - the FREE version of DB2 express and take
control of your XML. No limits. Just data. Click to get it now.
http://sourceforge.net/powerbar/db2/
_______________________________________________
Nutch-general mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/nutch-general