Daniel Templeton created MAPREDUCE-6864:
-------------------------------------------

             Summary: Hadoop streaming creates 2 mappers when the input has 
only one block
                 Key: MAPREDUCE-6864
                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-6864
             Project: Hadoop Map/Reduce
          Issue Type: Bug
          Components: mrv2
    Affects Versions: 2.7.3
            Reporter: Daniel Templeton


If a streaming job is run against input that is less than 2 blocks, 2 mappers 
will be created, both operating on the same split, both producing (duplicate) 
output.  In some cases the second mapper will consistently fail.  I've not seen 
the failure with input less than 10 bytes or more than a couple MB.  I have 
seen it with a 4kB input.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

---------------------------------------------------------------------
To unsubscribe, e-mail: mapreduce-dev-unsubscr...@hadoop.apache.org
For additional commands, e-mail: mapreduce-dev-h...@hadoop.apache.org

Reply via email to