Jason Lowe created TEZ-3335:
-------------------------------
Summary: DAG client thinks app is still running when app status is
null
Key: TEZ-3335
URL: https://issues.apache.org/jira/browse/TEZ-3335
Project: Apache Tez
Issue Type: Bug
Affects Versions: 0.7.1
Reporter: Jason Lowe
When an RM restarts without recovering apps (i.e.: either work-preserving is
not enabled or state store was removed) and the YARN application history is
enabled then YarnClient can return an application report with the app status as
null. The RM doesn't know about the application, so the client redirects to
the AHS. The AHS knows the app started at some point but will never received a
finished event, hence the null app status.
The DAG client fails to detect this scenario and believes the app is still
running, so for example Hive clients will continue to hammer for status on an
app that doesn't exist.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)