In Streaming, allow different mappers for different subsets of the input
------------------------------------------------------------------------
Key: HADOOP-2277
URL: https://issues.apache.org/jira/browse/HADOOP-2277
Project: Hadoop
Issue Type: Improvement
Components: contrib/streaming
Reporter: arkady borkovsky
The command line may look like this:
-mapper mapper-command-1 -input dir11 -mapper mapper-command-2 -input dir22
input -dir21
meaning that map phase will apply mapper-command-1 to part files from dir11,
and the part files from dir22 and dir21 will be processed by mapper-command-2
then all will be shuffled and processed by a single reducer.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.