[
https://issues.apache.org/jira/browse/LUCENE-7498?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15976654#comment-15976654
]
Alessandro Benedetti commented on LUCENE-7498:
----------------------------------------------
Hi [~rcmuir], thank you for your feedback, I agree on your points, let's
identify the next steps :
1) I will split up the patch
- MLT internal broken up in more maintainable classes, as much as back
compatible as possible ( scoring refactor allow to introduce new scorers in the
future with ease)
- introduction of BM25 term scorer ( possibly involving BM25 to become public
or partial duplicated code) -> this will remain in this JIRA
- Changes of Solr cloud classes, classification module, etc (maybe because of
api changes)
2) generate related JIRAs
3) proceed with review in order to reduce complexity ( first review will be
big, but much smaller and cohesive than this one
I will keep you updated,
> More Like This to Use BM25
> --------------------------
>
> Key: LUCENE-7498
> URL: https://issues.apache.org/jira/browse/LUCENE-7498
> Project: Lucene - Core
> Issue Type: Improvement
> Components: modules/other
> Reporter: Alessandro Benedetti
> Assignee: Tommaso Teofili
>
> BM25 is now the default similarity, but the more like this is still using the
> old TF/IDF .
>
> This issue is to move to BM25 and refactor the MLT to be more organised,
> extensible and maintainable.
> Few extensions will follow later, but the focus of this issue will be :
> - BM25
> - code refactor + tests
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]