[
https://issues.apache.org/jira/browse/MAHOUT-621?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13012029#comment-13012029
]
Julien Nioche commented on MAHOUT-621:
--------------------------------------
Re-Behemoth : I've started working on a Mahout module
(https://github.com/jnioche/behemoth/tree/master/modules/mahout) which will
help converting the Behemoth sequence files into vectors as done by seq2sparse.
Am searching for a way to get round
https://issues.apache.org/jira/browse/MAHOUT-368 but I think this is the last
hurdle in the way before the module is fully functional.
> Support more data import mechanisms
> -----------------------------------
>
> Key: MAHOUT-621
> URL: https://issues.apache.org/jira/browse/MAHOUT-621
> Project: Mahout
> Issue Type: Improvement
> Reporter: Grant Ingersoll
> Labels: gsoc2011, mahout-gsoc-11
>
> We should have more ways of getting data in:
> 1. ARFF (MAHOUT-155)
> 2. CSV (MAHOUT-548)
> 3. Databases
> 4. Behemoth (Tika, Map-Reduce)
> 5. Other
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira