[ https://issues.apache.org/jira/browse/MAPREDUCE-1849?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12931121#action_12931121 ]
Ted Dunning commented on MAPREDUCE-1849: ---------------------------------------- Pig is not a suitable framework for this because it imposes very high overhead due to a very wide and complex API and no abstract syntax layer. Contrarily, Plume is moving along very nicely. We have a preliminary optimizer that actually does some important optimizations that Pig doesn't do. Check it out: {quote} See http://github.com/tdunning/Plume for the source, http://tdunning.blogspot.com/2010/07/new-grool.html for some discussion. {quote} > Implement a FlumeJava-like library for operations over parallel collections > using Hadoop MapReduce > -------------------------------------------------------------------------------------------------- > > Key: MAPREDUCE-1849 > URL: https://issues.apache.org/jira/browse/MAPREDUCE-1849 > Project: Hadoop Map/Reduce > Issue Type: New Feature > Reporter: Jeff Hammerbacher > > The API used internally at Google is described in great detail at > http://portal.acm.org/citation.cfm?id=1806596.1806638. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.