Refactoring of the parallel Naive Bayes implementation in
org.apache.mahout.classifier.naivebayes
-------------------------------------------------------------------------------------------------
Key: MAHOUT-746
URL: https://issues.apache.org/jira/browse/MAHOUT-746
Project: Mahout
Issue Type: Improvement
Components: Classification
Affects Versions: 0.6
Reporter: Sebastian Schelter
Assignee: Sebastian Schelter
I refactored the code in org.apache.mahout.classifier.naivebayes to extend
AbstractJob, decoupled the model serialization from the job output, extracted
trainer classes and tried to clarify naming and reduce code complexity. I also
added tests for the training M/R code as well as a toy integration test.
It would be great if someone could review my patch to make sure I didn't break
anything.
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira