[ 
https://issues.apache.org/jira/browse/MAHOUT-605?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12994048#comment-12994048
 ] 

Ted Dunning commented on MAHOUT-605:
------------------------------------

Robin S.

Remember that CNB is computing a score that indicates whether the document 
belongs to the complementary class while the NB score indicates whether the 
document belongs to the class.  Thus, the code may be the same, but the origin 
of the weights is different.

As Robin A mentioned for 2 classes CNB and NB are equivalent, although 
reversed. 


> Array returned by classifier.bayes.algorithm.CBayesAlgorithm.classifyDocument 
> is sorted ascendant
> -------------------------------------------------------------------------------------------------
>
>                 Key: MAHOUT-605
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-605
>             Project: Mahout
>          Issue Type: Bug
>          Components: Classification
>    Affects Versions: 0.4
>         Environment: Linux
>            Reporter: Robin Swezey
>            Assignee: Robin Anil
>            Priority: Minor
>              Labels: bayesian, classification
>             Fix For: 0.5
>
>   Original Estimate: 1h
>  Remaining Estimate: 1h
>
> The array returned for a n-best call to classifyDocument is sorted ascendant 
> instead of descendant. 
> Ex:
> {quote}
> 47-best: [ClassifierResult\{category='香川県', score=32.28281232047167\},
> ClassifierResult\{category='宮崎県', score=32.28969992600906\}, ......,
> ClassifierResult\{category='愛知県', score=32.487981016587796\},
> ClassifierResult\{category='東京都', score=32.49189358054859\},
> ClassifierResult\{category='北海道', score=32.49811200756193\}]
> {quote}
> (classification of documents for Japanese prefectures)
> Inside the classifyDocument method, just before the return statement we found 
> this line:
> {quote}
> Collections.reverse(result);
> {quote}
> Is this a mistake or a design choice? (we are not sure, hence the "Minor" 
> priority)

-- 
This message is automatically generated by JIRA.
-
For more information on JIRA, see: http://www.atlassian.com/software/jira


Reply via email to