Bikas Saha created TEZ-1142:
-------------------------------

             Summary: Bail out early if a vertex has too many failures
                 Key: TEZ-1142
                 URL: https://issues.apache.org/jira/browse/TEZ-1142
             Project: Apache Tez
          Issue Type: Sub-task
            Reporter: Bikas Saha


If there are a high number of failures in a vertex then bail out early instead 
of waiting for 4 failures of the same task.
Lets say the vertex sees N consecutive failures without any successful task 
completion. Thats probably good enough evidence to infer that there is some bug 
in the code for the tasks in that vertex. Bailing out early wastes less 
resources.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to