DO NOT REPLY TO THIS EMAIL, BUT PLEASE POST YOUR BUG· RELATED COMMENTS THROUGH THE WEB INTERFACE AVAILABLE AT <http://issues.apache.org/bugzilla/show_bug.cgi?id=32921>. ANY REPLY MADE TO THIS MESSAGE WILL NOT BE COLLECTED AND· INSERTED IN THE BUG DATABASE.
http://issues.apache.org/bugzilla/show_bug.cgi?id=32921 Summary: Some utilities for a compact sparse filter Product: Lucene Version: CVS Nightly - Specify date in submission Platform: Other OS/Version: other Status: NEW Severity: enhancement Priority: P1 Component: Search AssignedTo: lucene-dev@jakarta.apache.org ReportedBy: [EMAIL PROTECTED] Two files are attached that might form the basis for an alternative filter implementation that is more memory efficient than one bit per doc when less than about 1/8 of the docs pass through the filter. The document numbers are stored in RAM as VInt's from the Lucene index format. These VInt's encode the difference between two successive document numbers, much like a PositionDelta in the Positions: http://jakarta.apache.org/lucene/docs/fileformats.html The getByteSize() method can be used to verify the compression once a SortedVIntList is constructed. The precise conditions under which this is more memory efficient than one bit per document are not easy to specify in advance. -- Configure bugmail: http://issues.apache.org/bugzilla/userprefs.cgi?tab=email ------- You are receiving this mail because: ------- You are the assignee for the bug, or are watching the assignee. --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]