On 27.03.2012 Dimitri Goldin wrote:
> Having tried Mallets naive bayes implementation we achieved ~95%
> accuracy without having to balance the training-data. Does anybody know
> which implementation detail might cause this or why balance seems
> influence mahouts implementation much more?

Without knowing the Mallet implementation: You describe that you tried using 
two 
tokenizations for your Mahout runs - what are you using when running Mallet?

Which Naive Bayes implementation in Mahout did you use?

Did you also try running with the complementary naive bayes implementation or 
the logistic regression instead?


Isabel

Attachment: signature.asc
Description: This is a digitally signed message part.

Reply via email to