Tao Yang created YARN-7003:
------------------------------
Summary: DRAINING state of queues can't be recovered after RM
restart
Key: YARN-7003
URL: https://issues.apache.org/jira/browse/YARN-7003
Project: Hadoop YARN
Issue Type: Bug
Components: capacityscheduler
Affects Versions: 3.0.0-alpha3
Reporter: Tao Yang
DRAINING state is a temporary state in RM memory, when queue state is set to be
STOPPED but there are still some pending or active apps in it, the queue state
will be changed to DRAINING instead of STOPPED after refreshing queues. We've
encountered the problem that the state of this queue will aways be STOPPED
after RM restarted, so that it can be removed at any time and leave some apps
in a non-existing queue.
To fix this problem, we could recover DRAINING state in the recovery process of
pending/active apps. I will upload a patch with test case later for review.
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]