[ 
https://issues.apache.org/jira/browse/MAHOUT-621?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13012029#comment-13012029
 ] 

Julien Nioche commented on MAHOUT-621:
--------------------------------------

Re-Behemoth : I've started working on a Mahout module 
(https://github.com/jnioche/behemoth/tree/master/modules/mahout) which will 
help converting the Behemoth sequence files into vectors as done by seq2sparse.

Am searching for a way to get round 
https://issues.apache.org/jira/browse/MAHOUT-368 but I think this is the last 
hurdle in the way before the module is fully functional.

> Support more data import mechanisms
> -----------------------------------
>
>                 Key: MAHOUT-621
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-621
>             Project: Mahout
>          Issue Type: Improvement
>            Reporter: Grant Ingersoll
>              Labels: gsoc2011, mahout-gsoc-11
>
> We should have more ways of getting data in:
> 1. ARFF (MAHOUT-155)
> 2. CSV (MAHOUT-548)
> 3. Databases
> 4. Behemoth (Tika, Map-Reduce)
> 5. Other

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to