[ 
https://issues.apache.org/jira/browse/YARN-1028?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13823126#comment-13823126
 ] 

Karthik Kambatla commented on YARN-1028:
----------------------------------------

I propose the following:
# Introduce {[ipc.client.connect.retry.interval}} and set it to 1 second by 
default
# Introduce {{yarn.resourcemanager.ha.failover-interval-ms}} with a default 
value of 1 second to capture how soon to failover. (Should we reuse 
{{yarn.resourcemanager.connect.retry-interval.ms}} for this also?) Based on 
this and the value of {{ipc.client.connect.max.retries}}, dynamically set 
{{ipc.client.connect.retry.interval}}. For example, for the default values of 1 
second for failover-interval and 10 for max.retries, the 
{{ipc.client.connect.retry.interval}} becomes 100 ms. 
# The number of times to failover is to be bounded by 
{{yarn.resourcemanager.connect.max-wait.ms}}

[~bikassaha], what do you think? 


> Add FailoverProxyProvider like capability to RMProxy
> ----------------------------------------------------
>
>                 Key: YARN-1028
>                 URL: https://issues.apache.org/jira/browse/YARN-1028
>             Project: Hadoop YARN
>          Issue Type: Sub-task
>            Reporter: Bikas Saha
>            Assignee: Karthik Kambatla
>         Attachments: yarn-1028-1.patch, yarn-1028-draft-cumulative.patch
>
>
> RMProxy layer currently abstracts RM discovery and implements it by looking 
> up service information from configuration. Motivated by HDFS and using 
> existing classes from Common, we can add failover proxy providers that may 
> provide RM discovery in extensible ways.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

Reply via email to