[
https://issues.apache.org/jira/browse/HADOOP-10127?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13835212#comment-13835212
]
Karthik Kambatla commented on HADOOP-10127:
-------------------------------------------
[~sandyr] and [~stevel], thanks for the discussion. Agree with you both on how
to handle this.
I propose the following:
# Add ipc.client.connect.retry.interval in this JIRA, so the knob is available
to be played with in YARN.
# YARN-1028 would implement ConfiguredFailoverProxyProvider with the
not-so-nice 10 second delay to failover for all entities connecting to the RM.
# YARN-1460 to define yarn-specific ipc-client configs that the clients use,
and add an exponential backoff and jitter at least for NM -> RM.
With that plan, what do you think of the current patch?
> Add ipc.client.connect.retry.interval to control the frequency of connection
> retries
> ------------------------------------------------------------------------------------
>
> Key: HADOOP-10127
> URL: https://issues.apache.org/jira/browse/HADOOP-10127
> Project: Hadoop Common
> Issue Type: Bug
> Components: ipc
> Affects Versions: 2.2.0
> Reporter: Karthik Kambatla
> Assignee: Karthik Kambatla
> Attachments: hadoop-10127-1.patch
>
>
> Currently, {{ipc.Client}} client attempts to connect to the server every 1
> second. It would be nice to make this configurable to be able to connect
> more/less frequently. Changing the number of retries alone is not granular
> enough.
--
This message was sent by Atlassian JIRA
(v6.1#6144)