Universal input file format for classifier algorithms in Mahout
---------------------------------------------------------------

                 Key: MAHOUT-785
                 URL: https://issues.apache.org/jira/browse/MAHOUT-785
             Project: Mahout
          Issue Type: Improvement
          Components: Classification
    Affects Versions: 0.6
            Reporter: XiaoboGu


I think a universal input file format is much more convinient for users, 
especially command line users, and we should even consider use some universal 
command line options for the classification algorithms, such as options for 
target/predictor variables and their types. Then users can prepare their data 
once, and build different models to get the best one. Currentlly we should 
consider the following:
1. SGD LogisticRegression
2. NaiveBayes
3. Bayes
4. Random Forest

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to