I think you will have problems there. The problem is that the iris data set has 5 or so continuous variables and naive bayes really only likes sparse binary features.
There is a patch at https://issues.apache.org/jira/browse/MAHOUT-228 that gives you the beginnings of an online logistic regression classifier (you will need to be ambitious to use that). The random forest implementation already in mahout can handle continuous variables as well. Neither of these other two implementations is nearly as polished (yet) as the Naive Bayes stuff. On Tue, Jun 8, 2010 at 1:55 AM, Robin Anil <[email protected]> wrote: > I am trying to use mahout's bayesian classifier over the iris dataset. >
