Nathan Roberts created TEZ-3440:

             Summary: Shuffling to memory can get out-of-sync when fetching 
multiple compressed map outputs
                 Key: TEZ-3440
             Project: Apache Tez
          Issue Type: Bug
            Reporter: Nathan Roberts

Haven't verified yet but certainly looks like tez needs same fix as 
MAPREDUCE-5308 in IFile.

Specifically saw this because downstream tasks were reporting enough fetch 
failures that long-running upstream tasks had to be re-run, which makes job run 
for much longer than it needs.

Usually shows itself as an "Invalid map id" error on a multi-map fetch on part 
2-n (i.e. never the first one).

This message was sent by Atlassian JIRA

Reply via email to