[ https://issues.apache.org/jira/browse/LUCENE-7823?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16074623#comment-16074623 ]
ASF subversion and git services commented on LUCENE-7823: --------------------------------------------------------- Commit 8ccb61c0af3c38dab6f1a62eafb836fb6415e55c in lucene-solr's branch refs/heads/master from [~teofili] [ https://git-wip-us.apache.org/repos/asf?p=lucene-solr.git;h=8ccb61c ] LUCENE-7823, LUCENE-7838 - added missing entires in changes.txt > Have a naive bayes classifier which uses plain BM25 scores instead of plain > frequencies > --------------------------------------------------------------------------------------- > > Key: LUCENE-7823 > URL: https://issues.apache.org/jira/browse/LUCENE-7823 > Project: Lucene - Core > Issue Type: Improvement > Components: modules/classification > Reporter: Tommaso Teofili > Assignee: Tommaso Teofili > Fix For: 7.0 > > > {{SimpleNaiveBayesClassifier}} users term frequencies with add one smoothing > to calculate likelihood and just tf for prior. Given Lucene has switched to > BM25 it would be better to have a different impl which uses BM25 > scoring as a probability measure of both prior and likelihood. -- This message was sent by Atlassian JIRA (v6.4.14#64029) --------------------------------------------------------------------- To unsubscribe, e-mail: dev-unsubscr...@lucene.apache.org For additional commands, e-mail: dev-h...@lucene.apache.org