[
https://issues.apache.org/jira/browse/LUCENE-8216?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16694447#comment-16694447
]
ASF subversion and git services commented on LUCENE-8216:
---------------------------------------------------------
Commit fd96bc5ca6b1cf0c24953fb7b35937e403846440 in lucene-solr's branch
refs/heads/master from [~jim.ferenczi]
[ https://git-wip-us.apache.org/repos/asf?p=lucene-solr.git;h=fd96bc5 ]
LUCENE-8216: Added a new BM25FQuery in sandbox to blend statistics across
several fields using the BM25F formula
> Better cross-field scoring
> --------------------------
>
> Key: LUCENE-8216
> URL: https://issues.apache.org/jira/browse/LUCENE-8216
> Project: Lucene - Core
> Issue Type: Improvement
> Reporter: Adrien Grand
> Priority: Major
> Fix For: master (8.0)
>
> Attachments: LUCENE-8216.patch, LUCENE-8216.patch
>
>
> I'd like Lucene to have better support for scoring across multiple fields.
> Today we have BlendedTermQuery which tries to help there but it probably
> tries to do too much on some aspects (handling cross-field term queries AND
> synonyms) and too little on other ones (it tries to merge index-level
> statistics, but not per-document statistics like tf and norm).
> Maybe we could implement something like BM25F so that queries across multiple
> fields would retain the benefits of BM25 like the fact that the impact of the
> term frequency saturates quickly, which is not the case with BlendedTermQuery
> if you have occurrences across many fields.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]