Yannis Hector created SOLR-10445:
------------------------------------
Summary: BloomFilteringPostingsFormat should implement
seekExact(TermState) to avoid to seek within matching segment/field multiple
times
Key: SOLR-10445
URL: https://issues.apache.org/jira/browse/SOLR-10445
Project: Solr
Issue Type: Bug
Security Level: Public (Default Security Level. Issues are Public)
Components: search
Affects Versions: 5.6
Reporter: Yannis Hector
Priority: Minor
BloomFilteringPostingsFormat$BloomFilteredFieldsProducer$BloomFilteredTermsEnum
does not reuse the TermState from the initial lookup when a second seek is
issued (usually when we have a match and we build the Scorer).
Default implementation of TermsEnum#seekExact(BytesRef term, TermState state)
that is used by BF calls the regular seekExact method.
This means that the BloomFilteringPostingsFormat default implementation
performs a second lookup for every segment/field that have a match for the term
(mostly in the various Weight#scorer implementations).
I don't think it is a big issue as we mostly expect search terms to have a
match in a limited number of segment/field. But for few queries it could become
inefficient.
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]