[
https://issues.apache.org/jira/browse/HADOOP-7086?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13197966#comment-13197966
]
Robert Joseph Evans commented on HADOOP-7086:
---------------------------------------------
I did not see HADOOP-7932 before. It does cover the exact same issue.
@Devaraj, the other patch
https://issues.apache.org/jira/secure/attachment/12507977/HADOOP-7932.patch
looks very similar to your patch. Perhaps what we want to do is just merge it
to trunk, if you want the fix right away, or you can wait for the HA branch to
be merged back into trunk, and we just mark this a dupe of HADOOP-7932.
> Retrying socket connection failure times can be made as configurable
> --------------------------------------------------------------------
>
> Key: HADOOP-7086
> URL: https://issues.apache.org/jira/browse/HADOOP-7086
> Project: Hadoop Common
> Issue Type: Improvement
> Components: conf
> Environment: NA
> Reporter: Devaraj K
> Assignee: Devaraj K
> Priority: Minor
> Fix For: 0.24.0
>
> Attachments: HADOOP-7086-1.patch, HADOOP-7086.patch, common-3899.patch
>
> Original Estimate: 72h
> Remaining Estimate: 72h
>
> Retrying socket connection failure times are hard coded as 45 and it is
> giving the retryring message for 45 times as below.
> 2011-01-04 15:14:30,700 INFO ipc.Client
> (Client.java:handleConnectionFailure(487)) - Retrying connect to server:
> /10.18.52.124:50020. Already tried 1 time(s).
> This can be made as configurable and also we can keep the default value as
> 45. If the user wants to decrease/increase, they can add this configurable
> property otherwise it can continue with the default value.
> common\src\java\org\apache\hadoop\ipc\Client.java:
> -----------------------------------------------------------------------
> private synchronized void setupConnection() throws IOException {
> short ioFailures = 0;
> short timeoutFailures = 0;
> while (true) {
> try {
> this.socket = socketFactory.createSocket();
> this.socket.setTcpNoDelay(tcpNoDelay);
> // connection time out is 20s
> NetUtils.connect(this.socket, remoteId.getAddress(), 20000);
> if (rpcTimeout > 0) {
> pingInterval = rpcTimeout; // rpcTimeout overwrites pingInterval
> }
> this.socket.setSoTimeout(pingInterval);
> return;
> } catch (SocketTimeoutException toe) {
> /*
> * The max number of retries is 45, which amounts to 20s*45 = 15
> * minutes retries.
> */
> handleConnectionFailure(timeoutFailures++, 45, toe);
> } catch (IOException ie) {
> handleConnectionFailure(ioFailures++, maxRetries, ie);
> }
> }
> }
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira