[jira] Commented: (MAPREDUCE-1849) Implement a FlumeJava-like library for operations over parallel collections using Hadoop MapReduce

Jeff Hammerbacher (JIRA) Thu, 10 Jun 2010 08:31:43 -0700

    [ 
https://issues.apache.org/jira/browse/MAPREDUCE-1849?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12877451#action_12877451
 ]


Jeff Hammerbacher commented on MAPREDUCE-1849:
----------------------------------------------

Owen: sure. They provide "derived operators" as well, like count(), join(), and 
top(). The main difference from Pig seems to be allowing users to work in Java. 
In fact, the Google team initially implemented their approach in a new language 
called Lumberjack, but mentions that, among other things, the implementation of 
a new language was a lot of work, and most importantly, novelty is an obstacle 
to adoption. They settled on Java and seem to have had some internal success.

> Implement a FlumeJava-like library for operations over parallel collections 
> using Hadoop MapReduce
> --------------------------------------------------------------------------------------------------
>
>                 Key: MAPREDUCE-1849
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-1849
>             Project: Hadoop Map/Reduce
>          Issue Type: New Feature
>            Reporter: Jeff Hammerbacher
>
> The API used internally at Google is described in great detail at 
> http://portal.acm.org/citation.cfm?id=1806596.1806638.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Commented: (MAPREDUCE-1849) Implement a FlumeJava-like library for operations over parallel collections using Hadoop MapReduce

Reply via email to