[ 
https://issues.apache.org/jira/browse/MAPREDUCE-2028?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12965849#action_12965849
 ] 

Allen Wittenauer commented on MAPREDUCE-2028:
---------------------------------------------

Actually, what should probably happen is that MultiFileWordCount's 
"MyInputFormat" and "MultiLineRecordRecord" should get promoted out of examples 
and officially into the mapred(uce) APIs. 

The following appears to implement exactly what us streaming users want/need:

$HADOOP_HOME/bin/hadoop  \
        jar \
        `ls $HADOOP_HOME/contrib/streaming/hadoop-*-streaming.jar` \
        -libjars `ls $HADOOP_HOME/hadoop-*-examples.jar` \
        -inputformat 
org.apache.hadoop.examples.MultiFileWordCount\$MyInputFormat \
        -inputreader 
org.apache.hadoop.examples.MultiFileWordCount\$MultiFileLineRecordReader \
        ....


> streaming should support MultiFileInputFormat
> ---------------------------------------------
>
>                 Key: MAPREDUCE-2028
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-2028
>             Project: Hadoop Map/Reduce
>          Issue Type: Bug
>          Components: contrib/streaming
>    Affects Versions: 0.20.2
>            Reporter: Allen Wittenauer
>             Fix For: 0.21.1, 0.22.0
>
>
> There should be a way to call MultiFileInputFormat from streaming without 
> having to write Java code...

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to