kyungwan nam created YARN-6153:
----------------------------------
Summary: keepContainer does not work when AM retry window is set
Key: YARN-6153
URL: https://issues.apache.org/jira/browse/YARN-6153
Project: Hadoop YARN
Issue Type: Bug
Components: resourcemanager
Affects Versions: 2.7.1
Reporter: kyungwan nam
yarn.resourcemanager.am.max-attempts has been configured to 2 in my cluster.
I submitted a YARN application (slider app) that keepContainers=true,
attemptFailuresValidityInterval=300000.
it did work properly when AM was failed firstly.
all containers launched by previous AM were resynced with new AM (attempt2)
without killing containers.
after 10 minutes, I thought AM failure count was reset by
attemptFailuresValidityInterval (5 minutes).
but, all containers were killed when AM was failed secondly. (new AM attempt3
was launched properly)
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]