Ok, I'll uncomment those lines and see. I also have plenty of test data 
available  too (I'm doing document classification with unbalanced classes), so 
I'll see if it improves there as well.

Also, I'll try to make some time in the next week and go over the algorithm in 
detail compared with the paper as an extra check.

Thanks,
Chandler
________________________________________
From: Sebastian Schelter <[email protected]>
Sent: Thursday, March 27, 2014 4:01 PM
To: [email protected]
Subject: Re: MAHOUT-1369 - Why does theta normalization for naive bayes 
classification commented out?

Hi Chandler,

I think a good way to go would be to reenable theta normalization and
run the classification examples that we already have to see how it
affects the result (and make sure it improves the result).

Would be great to have this fixed. I'm also planning to port NB to our
Spark DSL very soon (should be just a few lines of code).

--sebastian


On 03/27/2014 09:07 PM, Suneel Marthi wrote:
> Which Mahout version r u running? While its true that ThetaNormalizer is 
> still disabled today, Mahout-1389 fixes a bug wherein Complementary NB wasn't 
> being called when invoked.
>
> Please test with Mahout 0.9 or trunk.
>
>
>
>
> On Thursday, March 27, 2014 3:53 PM, Chandler Burgess 
> <[email protected]> wrote:
>
> Hello all,
>
> It seems Robin Anil hasn't responded, and no one is sure of the status on 
> this. What needs to be done on this, and/or what can I do to help? I'm no ML 
> expert, but I do have the paper and should be able to verify/fix the 
> implementation. I'm REALLY interested in using the CNB classifier, since it 
> seems well suited to the problem I'm trying to tackle, before I give up and 
> use something else.
>
> I've done tests and see no difference when -c is passed on the command line 
> for training or testing. I also wrote a program to print the scores using 
> StandardNaiveBayesClassifier and ComplementaryNaiveBayesClassifier in a 
> binary classification problem and see no difference between the scores, so it 
> seems complementary naïve bayes is completely disabled.
>
> Thanks,
> Chandler Burgess
>

Reply via email to