[
https://issues.apache.org/jira/browse/LUCENE-5084?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13697584#comment-13697584
]
Paul Elschot edited comment on LUCENE-5084 at 7/2/13 7:51 AM:
--------------------------------------------------------------
bq. Have you considered creating a PostingFormat with this? I was thinking in
something like DirectPostingsFormat but instead of using an array of ints for
storing the docIds using an Elias-Fano compressed bit stream.
The Vigna paper is all about posting formats.
Because of this I first implemented an encoder and a decoder in a long format,
and then used these here for a DocIdSet that works on int.
For a postings format, the encoder would need an additional constructor from
index data. That might involve merging the currently separate long arrays for
high bits and low bits into a single array.
was (Author: [email protected]):
bq. Have you considered creating a PostingFormat with this? I was thinking
in something like DirectPostingsFormat but instead of using an array of ints
for storing the docIds using an Elias-Fano compressed bit stream.
> EliasFanoDocIdSet
> -----------------
>
> Key: LUCENE-5084
> URL: https://issues.apache.org/jira/browse/LUCENE-5084
> Project: Lucene - Core
> Issue Type: Improvement
> Reporter: Paul Elschot
> Assignee: Adrien Grand
> Priority: Minor
> Attachments: LUCENE-5084.patch
>
>
> DocIdSet in Elias-Fano encoding
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]