Refactoring of the parallel Naive Bayes implementation in 
org.apache.mahout.classifier.naivebayes
-------------------------------------------------------------------------------------------------

                 Key: MAHOUT-746
                 URL: https://issues.apache.org/jira/browse/MAHOUT-746
             Project: Mahout
          Issue Type: Improvement
          Components: Classification
    Affects Versions: 0.6
            Reporter: Sebastian Schelter
            Assignee: Sebastian Schelter


I refactored the code in org.apache.mahout.classifier.naivebayes to extend 
AbstractJob, decoupled the model serialization from the job output, extracted 
trainer classes and tried to clarify naming and reduce code complexity. I also 
added tests for the training M/R code as well as a toy integration test.

It would be great if someone could review my patch to make sure I didn't break 
anything.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to