[
https://issues.apache.org/jira/browse/MAPREDUCE-1122?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12885482#action_12885482
]
Amareshwari Sriramadasu commented on MAPREDUCE-1122:
----------------------------------------------------
For supporting new api in streaming, the implementation involves two major
tasks:
# Setting job configuration for the streaming job: set appropriate mapper and
reducer depending on the arguments passed. Summarizing the above requirements
table :
** The old api mapper, PipeMapper, is used as mapper for the job only if
mapper is command and
a) old api input format is passed or
b) #reduces=0 and old api output format is passed or
c) #reduces !=0 and old api partitioner is passed.
** Similarly the old api reducer, PipeReducer, is used as reducer for the job
only if reducer is command and old output format is passed.
# Implementation of new api streaming mapper, reducer and etc.
> streaming with custom input format does not support the new API
> ---------------------------------------------------------------
>
> Key: MAPREDUCE-1122
> URL: https://issues.apache.org/jira/browse/MAPREDUCE-1122
> Project: Hadoop Map/Reduce
> Issue Type: Bug
> Components: contrib/streaming
> Affects Versions: 0.20.1
> Environment: any OS
> Reporter: Keith Jackson
> Assignee: Amareshwari Sriramadasu
>
> When trying to implement a custom input format for use with streaming, I have
> found that streaming does not support the new API,
> org.apache.hadoop.mapreduce.InputFormat, but requires the old API,
> org.apache.hadoop.mapred.InputFormat.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.