DO NOT REPLY TO THIS EMAIL, BUT PLEASE POST YOUR BUG RELATED COMMENTS THROUGH THE WEB INTERFACE AVAILABLE AT <http://issues.apache.org/bugzilla/show_bug.cgi?id=31149>. ANY REPLY MADE TO THIS MESSAGE WILL NOT BE COLLECTED AND INSERTED IN THE BUG DATABASE.
http://issues.apache.org/bugzilla/show_bug.cgi?id=31149 [PATCH] to store binary fields with compression ------- Additional Comments From [EMAIL PROTECTED] 2004-09-29 18:36 ------- hi, the zip file added today contains the improved version for the compression patch based on the latest source from cvs with new features discussed on the mailing list implemented. The patch contains three diff files (Field.diff, FieldsReader.diff and FieldsWriter.diff) and one new Testcase to test the compression functionality. This patch does allow now compression either on binary or string value fields. There also is a small cleanup in FieldsReader and FieldsWriter using static members referencing the bit values which makes the code more readable (Doug asked for it). The Field class now stores all 3 possible data values (reader, string or byte[]) within single member. This change was also asked by Doug and makes things easier to handle, but is not directly related to compression. Activating compression, the index size can be reduced to 60% of the original size when storing whole documents within the index. Increasing index time round about 50-70%. Regarding the query performance, i saw no differences between an compressed or uncompressed index. What may take a bit longer is fetching the hit documents. All lucene test cases works well. So maybe the lucene committers can have a look on it and decide if it will be part of the next version. If there are any questions regarding the changes, leave a note on the developer list. regards and fun with it bernhard --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]