Jason Lowe created TEZ-3335:
-------------------------------

             Summary: DAG client thinks app is still running when app status is 
null
                 Key: TEZ-3335
                 URL: https://issues.apache.org/jira/browse/TEZ-3335
             Project: Apache Tez
          Issue Type: Bug
    Affects Versions: 0.7.1
            Reporter: Jason Lowe


When an RM restarts without recovering apps (i.e.: either work-preserving is 
not enabled or state store was removed) and the YARN application history is 
enabled then YarnClient can return an application report with the app status as 
null.  The RM doesn't know about the application, so the client redirects to 
the AHS.  The AHS knows the app started at some point but will never received a 
finished event, hence the null app status.

The DAG client fails to detect this scenario and believes the app is still 
running, so for example Hive clients will continue to hammer for status on an 
app that doesn't exist.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to