DO NOT REPLY TO THIS EMAIL, BUT PLEASE POST YOUR BUG·
RELATED COMMENTS THROUGH THE WEB INTERFACE AVAILABLE AT
<http://issues.apache.org/bugzilla/show_bug.cgi?id=32921>.
ANY REPLY MADE TO THIS MESSAGE WILL NOT BE COLLECTED AND·
INSERTED IN THE BUG DATABASE.

http://issues.apache.org/bugzilla/show_bug.cgi?id=32921

           Summary: Some utilities for a compact sparse filter
           Product: Lucene
           Version: CVS Nightly - Specify date in submission
          Platform: Other
        OS/Version: other
            Status: NEW
          Severity: enhancement
          Priority: P1
         Component: Search
        AssignedTo: lucene-dev@jakarta.apache.org
        ReportedBy: [EMAIL PROTECTED]


Two files are attached that might form the basis for an alternative 
filter implementation that is more memory efficient than one bit 
per doc when less than about 1/8 of the docs pass through the filter. 
 
The document numbers are stored in RAM as VInt's from the Lucene index 
format. These VInt's encode the difference between two successive 
document numbers, much like a PositionDelta in the Positions: 
http://jakarta.apache.org/lucene/docs/fileformats.html 
 
The getByteSize() method can be used to verify the compression 
once a SortedVIntList is constructed. 
The precise conditions under which this is more memory efficient than 
one bit per document are not easy to specify in advance.

-- 
Configure bugmail: http://issues.apache.org/bugzilla/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug, or are watching the assignee.

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to