[jira] [Commented] (LUCENE-8311) Leverage impacts for phrase queries

Robert Muir (JIRA) Tue, 22 May 2018 03:30:30 -0700

    [ 
https://issues.apache.org/jira/browse/LUCENE-8311?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16483764#comment-16483764
 ]


Robert Muir commented on LUCENE-8311:
-------------------------------------

I wonder if its difficult to test with another similarity such as a DFR model? 
I'm only asking because I'm a little concerned that the bogus way we compute 
"phrase IDF" for BM25Similarity & ClassicSimilarity is getting in your way. 

All the other models use a more sane approach (scores like a disjunction 
internally). BM25 carried along the brain damage of ClassicSimilarity just 
because it was trying to minimize differences, but not for any particular good 
reason.

> Leverage impacts for phrase queries
> -----------------------------------
>
>                 Key: LUCENE-8311
>                 URL: https://issues.apache.org/jira/browse/LUCENE-8311
>             Project: Lucene - Core
>          Issue Type: Improvement
>            Reporter: Adrien Grand
>            Priority: Minor
>         Attachments: LUCENE-8311.patch
>
>
> Now that we expose raw impacts, we could leverage them for phrase queries.
> For instance for exact phrases, we could take the minimum term frequency for 
> each unique norm value in order to get upper bounds of the score for the 
> phrase.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[jira] [Commented] (LUCENE-8311) Leverage impacts for phrase queries

Reply via email to