Jonathan Eagles created TEZ-3914:

             Summary: Recovering a large DAG hang job
                 Key: TEZ-3914
             Project: Apache Tez
          Issue Type: Bug
            Reporter: Jonathan Eagles
            Assignee: Jonathan Eagles

Any failure to parse recovery event is ignore and treated as eof. Job can hang 
since some task completions may be missed and shuffle will hang.

This message was sent by Atlassian JIRA

Reply via email to