mansur created MAHOUT-1391:
------------------------------

             Summary: Possibility to disable confusion matrix in naive bayes
                 Key: MAHOUT-1391
                 URL: https://issues.apache.org/jira/browse/MAHOUT-1391
             Project: Mahout
          Issue Type: New Feature
          Components: Classification
    Affects Versions: 1.0
            Reporter: mansur


Sometimes confusion matrix is to big and not really necessary.

And there is another case for the possibility:
If you split a dataset with many labels with random selection percent to 
testdataset and trainingdataset, it could happen, that there are classes/labels 
in testdata, which do not appear in the trainingdataset. By creating a model 
with the trainingdata the created labelindex does not include some labels from 
testdata. Therefore if you test on this model with the testdata, mahout tries 
to create a confusion matrix with the labels from testdata which are not 
included in the labelindex and throws an exception.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

Reply via email to