[ 
https://issues.apache.org/jira/browse/MAHOUT-286?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sean Owen updated MAHOUT-286:
-----------------------------

           Issue Type: Improvement  (was: Bug)
        Fix Version/s: 0.5
    Affects Version/s: 0.3
             Priority: Minor  (was: Major)

I'll mark this for 0.5 but not seeing any movement on this, so may just get 
closed out

> Need to be able to run classifiers from non-text input (such as ARFF data)
> --------------------------------------------------------------------------
>
>                 Key: MAHOUT-286
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-286
>             Project: Mahout
>          Issue Type: Improvement
>    Affects Versions: 0.3
>            Reporter: Ted Dunning
>            Priority: Minor
>             Fix For: 0.5
>
>         Attachments: data.arff, data.training.arff, mahout.log, run.sh, 
> weka.log
>
>
> Martin Haeger wrote this:
> {quote}
> We're experimenting a bit with Weka and Mahout. Our input data is a
> relation in ARFF format (see attached data.training.arff), and we'd
> like to classify it using Mahout. However, it seems (to us, at first)
> that the Mahout classifier.bayes.interfaces.Algorithm interface is
> centered around documents of text, and not general attribute data.
> Thus, running the classifier causes our ARFF data to be interpreted as
> a document of words, with not very useful results (see attached
> mahout.log).
> With Weka, we're able to get the results we want (see attached weka.log).
> Any suggestions for how to get this working?
> {quote}

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to