[
https://issues.apache.org/jira/browse/MAHOUT-286?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sean Owen updated MAHOUT-286:
-----------------------------
Issue Type: Improvement (was: Bug)
Fix Version/s: 0.5
Affects Version/s: 0.3
Priority: Minor (was: Major)
I'll mark this for 0.5 but not seeing any movement on this, so may just get
closed out
> Need to be able to run classifiers from non-text input (such as ARFF data)
> --------------------------------------------------------------------------
>
> Key: MAHOUT-286
> URL: https://issues.apache.org/jira/browse/MAHOUT-286
> Project: Mahout
> Issue Type: Improvement
> Affects Versions: 0.3
> Reporter: Ted Dunning
> Priority: Minor
> Fix For: 0.5
>
> Attachments: data.arff, data.training.arff, mahout.log, run.sh,
> weka.log
>
>
> Martin Haeger wrote this:
> {quote}
> We're experimenting a bit with Weka and Mahout. Our input data is a
> relation in ARFF format (see attached data.training.arff), and we'd
> like to classify it using Mahout. However, it seems (to us, at first)
> that the Mahout classifier.bayes.interfaces.Algorithm interface is
> centered around documents of text, and not general attribute data.
> Thus, running the classifier causes our ARFF data to be interpreted as
> a document of words, with not very useful results (see attached
> mahout.log).
> With Weka, we're able to get the results we want (see attached weka.log).
> Any suggestions for how to get this working?
> {quote}
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.