Prabhu Joseph created YARN-11455:
------------------------------------
Summary: All RMs in HA are stuck in standby when the ZK connection
is disconnected
Key: YARN-11455
URL: https://issues.apache.org/jira/browse/YARN-11455
Project: Hadoop YARN
Issue Type: Bug
Components: resourcemanager
Affects Versions: 3.3.3, 2.10.1
Reporter: Prabhu Joseph
Assignee: Prabhu Joseph
All RMs in HA are stuck in standby when the ZK connection held by the active RM
is disconnected.
{code:java}
2023-02-22 13:08:19,832 INFO org.apache.hadoop.ha.ActiveStandbyElector
(main-EventThread): Session disconnected. Entering neutral mode...
2023-02-22 13:08:19,832 WARN
org.apache.hadoop.yarn.server.resourcemanager.ActiveStandbyElectorBasedElectorService
(main-EventThread): Lost contact with Zookeeper. Transitioning to standby in
10000 ms if connection is not reestablished.{code}
*Repro:*
Send a Disconnected Event to the Active RM using below code.
{code:java}
zkConnectionState = ConnectionState.DISCONNECTED;
enterNeutralMode();
{code}
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]