[
https://issues.apache.org/jira/browse/YARN-1934?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13967446#comment-13967446
]
Rohith commented on YARN-1934:
------------------------------
Lets say zkClient session connected to server X. Killing/Stopping server X
causes disconnected event. Immideately zkClient internally retry for other
configured servers. If it is able to establish connection with other server
then Syncconnected evet is triggered, elseif it is able to connect to one of
the configured server but Quorum is not satisfied tthen session expired event
triggered.
In my test, 2 out of 3 ZooKeeper is killed so that quorum not satisfied. But
these zookeeper servers were restarted after 5 seconds.
> Potential NPE in ZKRMStateStore caused by handling Disconnected event from ZK.
> ------------------------------------------------------------------------------
>
> Key: YARN-1934
> URL: https://issues.apache.org/jira/browse/YARN-1934
> Project: Hadoop YARN
> Issue Type: Bug
> Components: resourcemanager
> Affects Versions: 2.4.0
> Reporter: Rohith
> Priority: Critical
> Attachments: RM.txt
>
>
> For ZK disconnected event , zkClient is set to null. It is very much prone to
> throw NPE.
> {noformat}
> case Disconnected:
> LOG.info("ZKRMStateStore Session disconnected");
> oldZkClient = zkClient;
> zkClient = null;
> break;
> {noformat}
--
This message was sent by Atlassian JIRA
(v6.2#6252)