[ 
https://issues.apache.org/jira/browse/LUCENE-4782?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13582237#comment-13582237
 ] 

Robert Muir commented on LUCENE-4782:
-------------------------------------

I'm not sure we have to realistically worry about this too much.

It only applies to 3.x indexes: in general all current codecs support this 
statistic.

So another option is to simply add SuppressCodecs("Lucene3x") annotation to the 
classification module and document that you should run IndexUpgrader on any old 
3.x segments you have lying around.

                
> Let the NaiveBayes classifier have a fallback docCount method if codec 
> doesn't support Terms#docCount()
> -------------------------------------------------------------------------------------------------------
>
>                 Key: LUCENE-4782
>                 URL: https://issues.apache.org/jira/browse/LUCENE-4782
>             Project: Lucene - Core
>          Issue Type: Bug
>          Components: modules/classification
>            Reporter: Tommaso Teofili
>            Assignee: Tommaso Teofili
>             Fix For: 4.2, 5.0
>
>
> In _SimpleNaiveBayesClassifier_ _docsWithClassSize_ variable is initialized 
> to _MultiFields.getTerms(this.atomicReader, 
> this.classFieldName).getDocCount()_ which may be -1 if the codec doesn't 
> support doc counts, therefore there should be an alternative way to 
> initialize such a variable with the documents count.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to