Alessandro Benedetti created LUCENE-8347:
--------------------------------------------
Summary: BlendedInfixSuggester to handle multi term matches better
Key: LUCENE-8347
URL: https://issues.apache.org/jira/browse/LUCENE-8347
Project: Lucene - Core
Issue Type: Improvement
Components: core/search
Reporter: Alessandro Benedetti
Currently the blendedInfix suggester considers just the first match position
when scoring a suggestion.
>From the lucene-dev mailing list :
"
If I write more than one term in the query, let's say
"Mini Bar Fridge"
I would expect in the results something like (note that allTermsRequired=true
and the schema weight field always returns 1000)
- *Mini Bar Fridge* something
- *Mini Bar Fridge* something else
- *Mini Bar* something *Fridge*
- *Mini Bar* something else *Fridge*
- *Mini* something *Bar Fridge*
...
Instead I see this:
- *Mini Bar* something *Fridge*
- *Mini Bar* something else *Fridge*
- *Mini Bar Fridge* something
- *Mini Bar Fridge* something else
- *Mini* something *Bar Fridge*
...
After having a look at the suggester code
(BlendedInfixSuggester.createCoefficient), I see that the component takes in
account only one position, which is the lowest position (among the three
matching terms) within the term vector ("mini" in the example above) so all the
suggestions above have the same weight
"
Scope of this Jira issue is to improve the BlendedInfix to better manage those
scenarios.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]