Daniel Templeton created MAPREDUCE-6864:
-------------------------------------------
Summary: Hadoop streaming creates 2 mappers when the input has
only one block
Key: MAPREDUCE-6864
URL: https://issues.apache.org/jira/browse/MAPREDUCE-6864
Project: Hadoop Map/Reduce
Issue Type: Bug
Components: mrv2
Affects Versions: 2.7.3
Reporter: Daniel Templeton
If a streaming job is run against input that is less than 2 blocks, 2 mappers
will be created, both operating on the same split, both producing (duplicate)
output. In some cases the second mapper will consistently fail. I've not seen
the failure with input less than 10 bytes or more than a couple MB. I have
seen it with a 4kB input.
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]