Xiao Chen created HADOOP-15321:
----------------------------------
Summary: Reduce the RPC Client max retries on timeouts
Key: HADOOP-15321
URL: https://issues.apache.org/jira/browse/HADOOP-15321
Project: Hadoop Common
Issue Type: Improvement
Components: ipc
Reporter: Xiao Chen
Assignee: Xiao Chen
Currently, the
[default|https://github.com/apache/hadoop/blob/branch-3.0.0/hadoop-common-project/hadoop-common/src/main/java/org/apache/hadoop/fs/CommonConfigurationKeysPublic.java#L379]
number of retries when IPC client catch a {{ConnectTimeoutException}} is 45.
This seems unreasonably high.
Given the IPC client timeout is by default 60 seconds, if a DN host is shutdown
the client will retry for 45 minutes until aborting. (If host is there but
process down, it would throw a connection refused immediately, which is cool)
Creating this Jira to discuss whether we can reduce that to a reasonable number.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]