On Jul 31, 2008, at 3:09 PM, Robin Anil wrote:

Hi Grant,
             I had modified the BayesFileFormatter too use \t as the
character between the label and the collapsed document (As the default
TextInputFormat uses that). This change would also change the Bayes
Classifier Trainer as well as the Cbayes Classifier Feature Driver. Should i
go ahead and put in the Patch.

+1




Right now I have made sure that the output of the Trainer creates some
values. Which are then used in the final CBayes equation calculated in the getWeight(feature, label) function. The only thing i need to modify it to be
a Bayes classifier with all the
weight and length normalization except Cbayes complexity is just a few code
change in the Model.

Should i go ahead and change the Bayes classifier in the Next Patch

+1 Do what you need to do. What do you think about moving towards a matrix model, though, and using the proposed Matrix labels?

Perhaps we should just create a new issue that combines these two solutions and we point MAHOUT-9 and 60 to that one...

-Grant


Reply via email to