[ 
https://issues.apache.org/jira/browse/HADOOP-12547?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14992364#comment-14992364
 ] 

Allen Wittenauer commented on HADOOP-12547:
-------------------------------------------

bq. The Java client APIs provide significant advantages that neither streaming 
nor pipes provide. 

This is a false statement. Partitioning, for example, can't be done natively in 
streaming code but can in pipes.  In streaming, you can only provide a Java 
class.

bq.  In fact, they don't even mention streaming at all. In fact, they don't 
even mention streaming at all. Again I ask, why would you personally ever use 
or recommend pipes when streaming is available?

Correct. Because if the code is being written MR in C++, why would one use the 
less functional streaming API?  If one believes that MR jobs consist of nothing 
but reading and writing KVs I could see that, but there's a lot more going on 
under the hood in more advanced jobs.  That functionality is just flat-out not 
available in streaming.

BTW, thanks to this discourse, I realize that yes, not deprecating until trunk 
is completely the correct thing to do.  It clearly fills a gap not fulfilled by 
any other APIs.  So, I'm more convinced that ever that a -1 is appropriate here.

> Deprecate hadoop-pipes
> ----------------------
>
>                 Key: HADOOP-12547
>                 URL: https://issues.apache.org/jira/browse/HADOOP-12547
>             Project: Hadoop Common
>          Issue Type: Improvement
>            Reporter: Colin Patrick McCabe
>            Assignee: Colin Patrick McCabe
>            Priority: Minor
>
> Development appears to have stopped on hadoop-pipes upstream for the last few 
> years, aside from very basic maintenance.  Hadoop streaming seems to be a 
> better alternative, since it supports more programming languages and is 
> better implemented.
> There were no responses to a message on the mailing list asking for users of 
> Hadoop pipes... and in my experience, I have never seen anyone use this.  We 
> should remove it to reduce our maintenance burden and build times.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to