Explicit timeout for ipc.Client
-------------------------------

                 Key: HADOOP-642
                 URL: http://issues.apache.org/jira/browse/HADOOP-642
             Project: Hadoop
          Issue Type: Bug
    Affects Versions: 0.7.2
            Reporter: Konstantin Shvachko


This bug contributed to the crash discussed in HADOOP-572.
ipc.Client is trying to establish connection with its server with an infinite 
timeout.
For an unknown to me reason infinity equals 3 minutes in this case.
I guess it is configured somewhere in the native socket implementation.
With this timeout data-nodes had only 3 chances to send heartbeats during the 10
minute expiration interval. And with a very busy name-node this makes their
chances to be accepted close to 0.

I included an explicit call of Socket.connect() with a timeout set to 1 min, 
which is
our default for all connections.
Modified a log message to include information that turned out to be useful for 
debugging.
Removed unnecessary imports.


-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators: 
http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to