[jira] [Commented] (MAHOUT-1579) Implement a datamodel which can load data from hadoop filesystem directly

ASF GitHub Bot (JIRA) Sun, 15 Jun 2014 19:40:26 -0700

    [ 
https://issues.apache.org/jira/browse/MAHOUT-1579?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14032085#comment-14032085
 ]


ASF GitHub Bot commented on MAHOUT-1579:
----------------------------------------

GitHub user HuangXiaomeng opened a pull request:

    https://github.com/apache/mahout/pull/19

    MAHOUT-1579 Implement a data model which can load data from hadoop files...

    Now mahout deal with the data which stored in hdfs, it should run mapred 
job.
    I Implement a data model which can load data from hadoop filesystem 
directly, 
    and process data like FileDataModel.

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/HuangXiaomeng/mahout HadoopFSDataModel

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/mahout/pull/19.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #19
    
----
commit d756917cf94aa87468cdb09fb07fa559ace06120
Author: Huang Xiaomeng <[email protected]>
Date:   2014-06-16T02:12:09Z

    MAHOUT-1579 Implement a data model which can load data from hadoop 
filesystem directly

----


> Implement a datamodel which can load data from hadoop filesystem directly
> -------------------------------------------------------------------------
>
>                 Key: MAHOUT-1579
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-1579
>             Project: Mahout
>          Issue Type: Improvement
>            Reporter: Xiaomeng Huang
>            Priority: Minor
>         Attachments: Mahout-1579.patch
>
>
> As we all know, FileDataModel can only load data from local filesystem.
> But the big-data are usually stored in hadoop filesystem(e.g. hdfs).
> If we want to deal with the data in hdfs, we must run mapred job. 
> It's necessay to implement a data model which can load data from hadoop 
> filesystem directly.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Commented] (MAHOUT-1579) Implement a datamodel which can load data from hadoop filesystem directly

Reply via email to