Hi Suneel, I'm using 0.9. I did not train using Complementary NB, but was only using it for testing. I'm not real familiar with the math but can see CNBClassifier is scoring differently than SNBClassifier, so I thought I would see something, but the scores and results from testnb didn't change.
I'll get the trunk version and see if it fixes that part at least. ________________________________________ From: Suneel Marthi <[email protected]> Sent: Thursday, March 27, 2014 3:07 PM To: [email protected] Subject: Re: MAHOUT-1369 - Why does theta normalization for naive bayes classification commented out? Which Mahout version r u running? While its true that ThetaNormalizer is still disabled today, Mahout-1389 fixes a bug wherein Complementary NB wasn't being called when invoked. Please test with Mahout 0.9 or trunk. On Thursday, March 27, 2014 3:53 PM, Chandler Burgess <[email protected]> wrote: Hello all, It seems Robin Anil hasn't responded, and no one is sure of the status on this. What needs to be done on this, and/or what can I do to help? I'm no ML expert, but I do have the paper and should be able to verify/fix the implementation. I'm REALLY interested in using the CNB classifier, since it seems well suited to the problem I'm trying to tackle, before I give up and use something else. I've done tests and see no difference when -c is passed on the command line for training or testing. I also wrote a program to print the scores using StandardNaiveBayesClassifier and ComplementaryNaiveBayesClassifier in a binary classification problem and see no difference between the scores, so it seems complementary naïve bayes is completely disabled. Thanks, Chandler Burgess
