Yannis Hector created SOLR-10445:
------------------------------------

             Summary: BloomFilteringPostingsFormat should implement 
seekExact(TermState) to avoid to seek within matching segment/field multiple 
times
                 Key: SOLR-10445
                 URL: https://issues.apache.org/jira/browse/SOLR-10445
             Project: Solr
          Issue Type: Bug
      Security Level: Public (Default Security Level. Issues are Public)
          Components: search
    Affects Versions: 5.6
            Reporter: Yannis Hector
            Priority: Minor


BloomFilteringPostingsFormat$BloomFilteredFieldsProducer$BloomFilteredTermsEnum 
does not reuse the TermState from the initial lookup when a second seek is 
issued (usually when we have a match and we build the Scorer). 

Default implementation of TermsEnum#seekExact(BytesRef term, TermState state) 
that is used by BF calls the regular seekExact method. 

This means that the BloomFilteringPostingsFormat default implementation 
performs a second lookup for every segment/field that have a match for the term 
(mostly in the various Weight#scorer implementations).

I don't think it is a big issue as we mostly expect search terms to have a 
match in a limited number of segment/field. But for few queries it could become 
inefficient.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to