Nathan Roberts created TEZ-3440:
-----------------------------------

             Summary: Shuffling to memory can get out-of-sync when fetching 
multiple compressed map outputs
                 Key: TEZ-3440
                 URL: https://issues.apache.org/jira/browse/TEZ-3440
             Project: Apache Tez
          Issue Type: Bug
            Reporter: Nathan Roberts


Haven't verified yet but certainly looks like tez needs same fix as 
MAPREDUCE-5308 in IFile.

Specifically saw this because downstream tasks were reporting enough fetch 
failures that long-running upstream tasks had to be re-run, which makes job run 
for much longer than it needs.

Usually shows itself as an "Invalid map id" error on a multi-map fetch on part 
2-n (i.e. never the first one).
 




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to