On 27.03.2012 Dimitri Goldin wrote: > Having tried Mallets naive bayes implementation we achieved ~95% > accuracy without having to balance the training-data. Does anybody know > which implementation detail might cause this or why balance seems > influence mahouts implementation much more?
Without knowing the Mallet implementation: You describe that you tried using two tokenizations for your Mahout runs - what are you using when running Mallet? Which Naive Bayes implementation in Mahout did you use? Did you also try running with the complementary naive bayes implementation or the logistic regression instead? Isabel
signature.asc
Description: This is a digitally signed message part.
