Alessandro Benedetti created LUCENE-8347:
--------------------------------------------

             Summary: BlendedInfixSuggester to handle multi term matches better
                 Key: LUCENE-8347
                 URL: https://issues.apache.org/jira/browse/LUCENE-8347
             Project: Lucene - Core
          Issue Type: Improvement
          Components: core/search
            Reporter: Alessandro Benedetti


Currently the blendedInfix suggester considers just the first match position 
when scoring a suggestion.
>From the lucene-dev mailing list :
"
If I write more than one term in the query, let's say 
 
"Mini Bar Fridge" 
 
I would expect in the results something like (note that allTermsRequired=true 
and the schema weight field always returns 1000)
 
- *Mini Bar Fridge* something
- *Mini Bar Fridge* something else
- *Mini Bar* something *Fridge*        
- *Mini Bar* something else *Fridge*
- *Mini* something *Bar Fridge*
...
 
Instead I see this: 
 
- *Mini Bar* something *Fridge*        
- *Mini Bar* something else *Fridge*
- *Mini Bar Fridge* something
- *Mini Bar Fridge* something else
- *Mini* something *Bar Fridge*
...
 
After having a look at the suggester code 
(BlendedInfixSuggester.createCoefficient), I see that the component takes in 
account only one position, which is the lowest position (among the three 
matching terms) within the term vector ("mini" in the example above) so all the 
suggestions above have the same weight 
"
Scope of this Jira issue is to improve the BlendedInfix to better manage those 
scenarios.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to