[jira] [Commented] (LUCENE-5293) Also use EliasFanoDocIdSet in CachingWrapperFilter

Paul Elschot (JIRA) Fri, 18 Oct 2013 00:50:40 -0700

    [ 
https://issues.apache.org/jira/browse/LUCENE-5293?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13798886#comment-13798886
 ]


Paul Elschot commented on LUCENE-5293:
--------------------------------------

bq.  ... the closest thing we have to a cardinality() which is available for 
every DocIdSet.

For a single term used as a filter there is IndexReader.docFreq(Term), and that 
does fit in here.
(Ideally in this case the posting list could be copied from the index into the 
cache, but we're not there yet.)

Shall I add a check for QueryWrapperFilter.getQuery() being a TermQuery and 
then use docFreq() ?

There is also a TermFilter in the queries module. I have not looked at the code 
yet.
Is it necessary to move that to core so a check for that can be used here, too?

bq. ... maybe we could build a WAH8DocIdSet in any case and replace it with an 
EF doc id set when there are not many documents???

For any case in which the cardinality can not easily be determined, that indeed 
would make sense from the benchmark.

bq. I can try to update the benchmarks to add the building of an additional FBS 
before the EF doc id set. 

Adding a single FBS build to the EF DocIdSet can be visualized in the benchmark 
for the build times.
In that case the EF DocIdSet build result never gets above log(1) = 0, so an 
update of the benchmarks would not be needed.



> Also use EliasFanoDocIdSet in CachingWrapperFilter
> --------------------------------------------------
>
>                 Key: LUCENE-5293
>                 URL: https://issues.apache.org/jira/browse/LUCENE-5293
>             Project: Lucene - Core
>          Issue Type: Improvement
>          Components: core/search
>            Reporter: Paul Elschot
>            Priority: Minor
>         Attachments: LUCENE-5293.patch
>
>




--
This message was sent by Atlassian JIRA
(v6.1#6144)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: dev-h...@lucene.apache.org

[jira] [Commented] (LUCENE-5293) Also use EliasFanoDocIdSet in CachingWrapperFilter

Reply via email to