[ 
https://issues.apache.org/jira/browse/YARN-1934?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13967446#comment-13967446
 ] 

Rohith commented on YARN-1934:
------------------------------

Lets say zkClient session connected to server X. Killing/Stopping server X 
causes disconnected event. Immideately zkClient internally retry for other 
configured servers. If it is able to establish connection with other server 
then Syncconnected evet is triggered, elseif it is able to connect to one of 
the configured server but Quorum is not satisfied tthen session expired event 
triggered.

In my test,  2 out of 3 ZooKeeper is killed so that quorum not satisfied. But 
these zookeeper servers were restarted after 5 seconds.

> Potential NPE in ZKRMStateStore caused by handling Disconnected event from ZK.
> ------------------------------------------------------------------------------
>
>                 Key: YARN-1934
>                 URL: https://issues.apache.org/jira/browse/YARN-1934
>             Project: Hadoop YARN
>          Issue Type: Bug
>          Components: resourcemanager
>    Affects Versions: 2.4.0
>            Reporter: Rohith
>            Priority: Critical
>         Attachments: RM.txt
>
>
> For ZK disconnected event , zkClient is set to null. It is very much prone to 
> throw NPE.
> {noformat}
>         case Disconnected:
>           LOG.info("ZKRMStateStore Session disconnected");
>           oldZkClient = zkClient;
>           zkClient = null;
>           break;
> {noformat}



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to