[ 
https://issues.apache.org/jira/browse/YARN-4243?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14950630#comment-14950630
 ] 

Rohith Sharma K S commented on YARN-4243:
-----------------------------------------

Thanks [~xgong] for working on this.
Some comments and suggestions
# While initializing Elector service createConnection will retry as per 
configured value i.e *maxRetryNum* say 10. But if session is closed and 
reestablished then number of retry count will be  *maxRetryNum* * *maxRetryNum* 
i.e 10*10=100 times.
# And method {{reEstablishSession()}} can be reused rather duplicating same 
logic over embedded electors.  Instead of overriding createConnection() method, 
reEstablishSession() method can be used in ActiveStandByElector constructor.I'd 
prefer to make change in hadoop-common rather in embedded elector service. 

> Add retry on establishing Zookeeper conenction in 
> EmbeddedElectorService#serviceInit
> ------------------------------------------------------------------------------------
>
>                 Key: YARN-4243
>                 URL: https://issues.apache.org/jira/browse/YARN-4243
>             Project: Hadoop YARN
>          Issue Type: Sub-task
>          Components: resourcemanager
>            Reporter: Xuan Gong
>            Assignee: Xuan Gong
>         Attachments: YARN-4243.1.patch
>
>
> Right now, the RM would shut down if the zk connection is down when the RM do 
> the initialization. We need to add retry on this part



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to