Prabhu Joseph created YARN-11455:
------------------------------------

             Summary: All RMs in HA are stuck in standby when the ZK connection 
is disconnected
                 Key: YARN-11455
                 URL: https://issues.apache.org/jira/browse/YARN-11455
             Project: Hadoop YARN
          Issue Type: Bug
          Components: resourcemanager
    Affects Versions: 3.3.3, 2.10.1
            Reporter: Prabhu Joseph
            Assignee: Prabhu Joseph


All RMs in HA are stuck in standby when the ZK connection held by the active RM 
is disconnected.
{code:java}
2023-02-22 13:08:19,832 INFO org.apache.hadoop.ha.ActiveStandbyElector 
(main-EventThread): Session disconnected. Entering neutral mode...
2023-02-22 13:08:19,832 WARN 
org.apache.hadoop.yarn.server.resourcemanager.ActiveStandbyElectorBasedElectorService
 (main-EventThread): Lost contact with Zookeeper. Transitioning to standby in 
10000 ms if connection is not reestablished.{code}
 

*Repro:*

Send a Disconnected Event to the Active RM using below code.
{code:java}
zkConnectionState = ConnectionState.DISCONNECTED;
enterNeutralMode();
{code}
 



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to