[ 
https://issues.apache.org/jira/browse/HADOOP-12547?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14992218#comment-14992218
 ] 

Allen Wittenauer commented on HADOOP-12547:
-------------------------------------------

bq. Because the Java MR API only supports Java (and possibly other JVM 
languages), whereas streaming supports Perl, Python, Ruby, C, C++, and any 
other non-JVM programming language you can think of.

I'm pretty sure streaming supports JVM languages too since I'd be surprised if 
Java couldn't read and write from stdin and stdout... which, by your own 
argument, means we should drop the Java client APIs too.  After all, that would 
reduce the code footprint, limit the testing needs, etc, etc, too right?

bq. What specifically are you -1 on? Removal, deprecation, or both?

At this point, both.  Perhaps deprecation in trunk if the native task stuff 
actually works.

bq. Can you explain when you would advise one of your customers to use pipes 
instead of streaming?

https://www.quora.com/Why-would-anyone-use-Hadoop-Pipes

https://www.quora.com/If-my-current-job-involves-purely-C-C++-coding-what-are-the-best-ways-to-learn-hadoop-and-contribute-to-the-apache-hadoop-project-I-understand-most-of-hadoop-code-is-Java-Are-there-any-C-C++-bindings-for-hadoop-used-in-production-clusters

bq. If you feel that pipes is worth maintaining, can you file JIRAs to 
reinstate the documentation, fix the compiler warnings, and fix the security 
bugs?

Sure, I'll file JIRAs for these.

> Deprecate hadoop-pipes
> ----------------------
>
>                 Key: HADOOP-12547
>                 URL: https://issues.apache.org/jira/browse/HADOOP-12547
>             Project: Hadoop Common
>          Issue Type: Improvement
>            Reporter: Colin Patrick McCabe
>            Assignee: Colin Patrick McCabe
>            Priority: Minor
>
> Development appears to have stopped on hadoop-pipes upstream for the last few 
> years, aside from very basic maintenance.  Hadoop streaming seems to be a 
> better alternative, since it supports more programming languages and is 
> better implemented.
> There were no responses to a message on the mailing list asking for users of 
> Hadoop pipes... and in my experience, I have never seen anyone use this.  We 
> should remove it to reduce our maintenance burden and build times.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to