[ 
https://issues.apache.org/jira/browse/HDDS-2107?focusedWorklogId=310907&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-310907
 ]

ASF GitHub Bot logged work on HDDS-2107:
----------------------------------------

                Author: ASF GitHub Bot
            Created on: 11/Sep/19 18:20
            Start Date: 11/Sep/19 18:20
    Worklog Time Spent: 10m 
      Work Description: vivekratnavel commented on issue #1424: HDDS-2107. 
Datanodes should retry forever to connect to SCM in an…
URL: https://github.com/apache/hadoop/pull/1424#issuecomment-530503260
 
 
   @adoroszlai You are right. With this change, we don't get the error from 
`EndPointStateMachine`  and the result now looks like this:
   ```
   datanode_1  | 2019-09-11 18:16:55 INFO  InitDatanodeState:140 - 
DatanodeDetails is persisted to /data/datanode.id
   datanode_1  | 2019-09-11 18:16:57 INFO  Client:948 - Retrying connect to 
server: datanode/172.19.0.2:9861. Already tried 0 time(s); retry policy is 
RetryUpToMaximumCountWithFixedSleep(maxRetries=2147483647, sleepTime=1000 
MILLISECONDS)
   datanode_1  | 2019-09-11 18:16:58 INFO  Client:948 - Retrying connect to 
server: datanode/172.19.0.2:9861. Already tried 1 time(s); retry policy is 
RetryUpToMaximumCountWithFixedSleep(maxRetries=2147483647, sleepTime=1000 
MILLISECONDS)
   datanode_1  | 2019-09-11 18:16:59 INFO  Client:948 - Retrying connect to 
server: datanode/172.19.0.2:9861. Already tried 2 time(s); retry policy is 
RetryUpToMaximumCountWithFixedSleep(maxRetries=2147483647, sleepTime=1000 
MILLISECONDS)
   datanode_1  | 2019-09-11 18:17:00 INFO  Client:948 - Retrying connect to 
server: datanode/172.19.0.2:9861. Already tried 3 time(s); retry policy is 
RetryUpToMaximumCountWithFixedSleep(maxRetries=2147483647, sleepTime=1000 
MILLISECONDS)
   datanode_1  | 2019-09-11 18:17:01 INFO  Client:948 - Retrying connect to 
server: datanode/172.19.0.2:9861. Already tried 4 time(s); retry policy is 
RetryUpToMaximumCountWithFixedSleep(maxRetries=2147483647, sleepTime=1000 
MILLISECONDS)
   datanode_1  | 2019-09-11 18:17:02 INFO  Client:948 - Retrying connect to 
server: datanode/172.19.0.2:9861. Already tried 5 time(s); retry policy is 
RetryUpToMaximumCountWithFixedSleep(maxRetries=2147483647, sleepTime=1000 
MILLISECONDS)
   datanode_1  | 2019-09-11 18:17:03 INFO  Client:948 - Retrying connect to 
server: datanode/172.19.0.2:9861. Already tried 6 time(s); retry policy is 
RetryUpToMaximumCountWithFixedSleep(maxRetries=2147483647, sleepTime=1000 
MILLISECONDS)
   datanode_1  | 2019-09-11 18:17:04 INFO  Client:948 - Retrying connect to 
server: datanode/172.19.0.2:9861. Already tried 7 time(s); retry policy is 
RetryUpToMaximumCountWithFixedSleep(maxRetries=2147483647, sleepTime=1000 
MILLISECONDS)
   datanode_1  | 2019-09-11 18:17:05 INFO  Client:948 - Retrying connect to 
server: datanode/172.19.0.2:9861. Already tried 8 time(s); retry policy is 
RetryUpToMaximumCountWithFixedSleep(maxRetries=2147483647, sleepTime=1000 
MILLISECONDS)
   datanode_1  | 2019-09-11 18:17:06 INFO  Client:948 - Retrying connect to 
server: datanode/172.19.0.2:9861. Already tried 9 time(s); retry policy is 
RetryUpToMaximumCountWithFixedSleep(maxRetries=2147483647, sleepTime=1000 
MILLISECONDS)
   datanode_1  | 2019-09-11 18:17:07 INFO  Client:948 - Retrying connect to 
server: datanode/172.19.0.2:9861. Already tried 10 time(s); retry policy is 
RetryUpToMaximumCountWithFixedSleep(maxRetries=2147483647, sleepTime=1000 
MILLISECONDS)
   datanode_1  | 2019-09-11 18:17:08 INFO  Client:948 - Retrying connect to 
server: datanode/172.19.0.2:9861. Already tried 11 time(s); retry policy is 
RetryUpToMaximumCountWithFixedSleep(maxRetries=2147483647, sleepTime=1000 
MILLISECONDS)
   datanode_1  | 2019-09-11 18:17:09 INFO  Client:948 - Retrying connect to 
server: datanode/172.19.0.2:9861. Already tried 12 time(s); retry policy is 
RetryUpToMaximumCountWithFixedSleep(maxRetries=2147483647, sleepTime=1000 
MILLISECONDS)
   datanode_1  | 2019-09-11 18:17:10 INFO  Client:948 - Retrying connect to 
server: datanode/172.19.0.2:9861. Already tried 13 time(s); retry policy is 
RetryUpToMaximumCountWithFixedSleep(maxRetries=2147483647, sleepTime=1000 
MILLISECONDS)
   datanode_1  | 2019-09-11 18:17:11 INFO  Client:948 - Retrying connect to 
server: datanode/172.19.0.2:9861. Already tried 14 time(s); retry policy is 
RetryUpToMaximumCountWithFixedSleep(maxRetries=2147483647, sleepTime=1000 
MILLISECONDS)
   datanode_1  | 2019-09-11 18:17:12 INFO  Client:948 - Retrying connect to 
server: datanode/172.19.0.2:9861. Already tried 15 time(s); retry policy is 
RetryUpToMaximumCountWithFixedSleep(maxRetries=2147483647, sleepTime=1000 
MILLISECONDS)
   datanode_1  | 2019-09-11 18:17:13 INFO  Client:948 - Retrying connect to 
server: datanode/172.19.0.2:9861. Already tried 16 time(s); retry policy is 
RetryUpToMaximumCountWithFixedSleep(maxRetries=2147483647, sleepTime=1000 
MILLISECONDS)
   datanode_1  | 2019-09-11 18:17:14 INFO  Client:948 - Retrying connect to 
server: datanode/172.19.0.2:9861. Already tried 17 time(s); retry policy is 
RetryUpToMaximumCountWithFixedSleep(maxRetries=2147483647, sleepTime=1000 
MILLISECONDS)
   datanode_1  | 2019-09-11 18:17:15 INFO  Client:948 - Retrying connect to 
server: datanode/172.19.0.2:9861. Already tried 18 time(s); retry policy is 
RetryUpToMaximumCountWithFixedSleep(maxRetries=2147483647, sleepTime=1000 
MILLISECONDS)
   datanode_1  | 2019-09-11 18:17:16 INFO  Client:948 - Retrying connect to 
server: datanode/172.19.0.2:9861. Already tried 19 time(s); retry policy is 
RetryUpToMaximumCountWithFixedSleep(maxRetries=2147483647, sleepTime=1000 
MILLISECONDS)
   datanode_1  | 2019-09-11 18:17:17 INFO  Client:948 - Retrying connect to 
server: datanode/172.19.0.2:9861. Already tried 20 time(s); retry policy is 
RetryUpToMaximumCountWithFixedSleep(maxRetries=2147483647, sleepTime=1000 
MILLISECONDS)
   datanode_1  | 2019-09-11 18:17:18 INFO  Client:948 - Retrying connect to 
server: datanode/172.19.0.2:9861. Already tried 21 time(s); retry policy is 
RetryUpToMaximumCountWithFixedSleep(maxRetries=2147483647, sleepTime=1000 
MILLISECONDS)
   datanode_1  | 2019-09-11 18:17:19 INFO  Client:948 - Retrying connect to 
server: datanode/172.19.0.2:9861. Already tried 22 time(s); retry policy is 
RetryUpToMaximumCountWithFixedSleep(maxRetries=2147483647, sleepTime=1000 
MILLISECONDS)
   datanode_1  | 2019-09-11 18:17:20 INFO  Client:948 - Retrying connect to 
server: datanode/172.19.0.2:9861. Already tried 23 time(s); retry policy is 
RetryUpToMaximumCountWithFixedSleep(maxRetries=2147483647, sleepTime=1000 
MILLISECONDS)
   datanode_1  | 2019-09-11 18:17:21 INFO  Client:948 - Retrying connect to 
server: datanode/172.19.0.2:9861. Already tried 24 time(s); retry policy is 
RetryUpToMaximumCountWithFixedSleep(maxRetries=2147483647, sleepTime=1000 
MILLISECONDS)
   datanode_1  | 2019-09-11 18:17:22 INFO  Client:948 - Retrying connect to 
server: datanode/172.19.0.2:9861. Already tried 25 time(s); retry policy is 
RetryUpToMaximumCountWithFixedSleep(maxRetries=2147483647, sleepTime=1000 
MILLISECONDS)
   datanode_1  | 2019-09-11 18:17:23 INFO  Client:948 - Retrying connect to 
server: datanode/172.19.0.2:9861. Already tried 26 time(s); retry policy is 
RetryUpToMaximumCountWithFixedSleep(maxRetries=2147483647, sleepTime=1000 
MILLISECONDS)
   datanode_1  | 2019-09-11 18:17:24 INFO  Client:948 - Retrying connect to 
server: datanode/172.19.0.2:9861. Already tried 27 time(s); retry policy is 
RetryUpToMaximumCountWithFixedSleep(maxRetries=2147483647, sleepTime=1000 
MILLISECONDS)
   datanode_1  | 2019-09-11 18:17:25 INFO  Client:948 - Retrying connect to 
server: datanode/172.19.0.2:9861. Already tried 28 time(s); retry policy is 
RetryUpToMaximumCountWithFixedSleep(maxRetries=2147483647, sleepTime=1000 
MILLISECONDS)
   datanode_1  | 2019-09-11 18:17:26 INFO  Client:948 - Retrying connect to 
server: datanode/172.19.0.2:9861. Already tried 29 time(s); retry policy is 
RetryUpToMaximumCountWithFixedSleep(maxRetries=2147483647, sleepTime=1000 
MILLISECONDS)
   datanode_1  | 2019-09-11 18:17:27 INFO  Client:948 - Retrying connect to 
server: datanode/172.19.0.2:9861. Already tried 30 time(s); retry policy is 
RetryUpToMaximumCountWithFixedSleep(maxRetries=2147483647, sleepTime=1000 
MILLISECONDS)
   datanode_1  | 2019-09-11 18:17:28 INFO  Client:948 - Retrying connect to 
server: datanode/172.19.0.2:9861. Already tried 31 time(s); retry policy is 
RetryUpToMaximumCountWithFixedSleep(maxRetries=2147483647, sleepTime=1000 
MILLISECONDS)
   datanode_1  | 2019-09-11 18:17:29 INFO  Client:948 - Retrying connect to 
server: datanode/172.19.0.2:9861. Already tried 32 time(s); retry policy is 
RetryUpToMaximumCountWithFixedSleep(maxRetries=2147483647, sleepTime=1000 
MILLISECONDS)
   datanode_1  | 2019-09-11 18:17:30 INFO  Client:948 - Retrying connect to 
server: datanode/172.19.0.2:9861. Already tried 33 time(s); retry policy is 
RetryUpToMaximumCountWithFixedSleep(maxRetries=2147483647, sleepTime=1000 
MILLISECONDS)
   datanode_1  | 2019-09-11 18:17:31 INFO  Client:948 - Retrying connect to 
server: datanode/172.19.0.2:9861. Already tried 34 time(s); retry policy is 
RetryUpToMaximumCountWithFixedSleep(maxRetries=2147483647, sleepTime=1000 
MILLISECONDS)
   datanode_1  | 2019-09-11 18:17:32 INFO  Client:948 - Retrying connect to 
server: datanode/172.19.0.2:9861. Already tried 35 time(s); retry policy is 
RetryUpToMaximumCountWithFixedSleep(maxRetries=2147483647, sleepTime=1000 
MILLISECONDS)
   datanode_1  | 2019-09-11 18:17:33 INFO  Client:948 - Retrying connect to 
server: datanode/172.19.0.2:9861. Already tried 36 time(s); retry policy is 
RetryUpToMaximumCountWithFixedSleep(maxRetries=2147483647, sleepTime=1000 
MILLISECONDS)
   datanode_1  | 2019-09-11 18:17:34 INFO  Client:948 - Retrying connect to 
server: datanode/172.19.0.2:9861. Already tried 37 time(s); retry policy is 
RetryUpToMaximumCountWithFixedSleep(maxRetries=2147483647, sleepTime=1000 
MILLISECONDS)
   datanode_1  | 2019-09-11 18:17:35 INFO  Client:948 - Retrying connect to 
server: datanode/172.19.0.2:9861. Already tried 38 time(s); retry policy is 
RetryUpToMaximumCountWithFixedSleep(maxRetries=2147483647, sleepTime=1000 
MILLISECONDS)
   datanode_1  | 2019-09-11 18:17:36 INFO  Client:948 - Retrying connect to 
server: datanode/172.19.0.2:9861. Already tried 39 time(s); retry policy is 
RetryUpToMaximumCountWithFixedSleep(maxRetries=2147483647, sleepTime=1000 
MILLISECONDS)
   datanode_1  | 2019-09-11 18:17:37 INFO  Client:948 - Retrying connect to 
server: datanode/172.19.0.2:9861. Already tried 40 time(s); retry policy is 
RetryUpToMaximumCountWithFixedSleep(maxRetries=2147483647, sleepTime=1000 
MILLISECONDS)
   datanode_1  | 2019-09-11 18:17:38 INFO  Client:948 - Retrying connect to 
server: datanode/172.19.0.2:9861. Already tried 41 time(s); retry policy is 
RetryUpToMaximumCountWithFixedSleep(maxRetries=2147483647, sleepTime=1000 
MILLISECONDS)
   datanode_1  | 2019-09-11 18:17:39 INFO  Client:948 - Retrying connect to 
server: datanode/172.19.0.2:9861. Already tried 42 time(s); retry policy is 
RetryUpToMaximumCountWithFixedSleep(maxRetries=2147483647, sleepTime=1000 
MILLISECONDS)
   datanode_1  | 2019-09-11 18:17:40 INFO  Client:948 - Retrying connect to 
server: datanode/172.19.0.2:9861. Already tried 43 time(s); retry policy is 
RetryUpToMaximumCountWithFixedSleep(maxRetries=2147483647, sleepTime=1000 
MILLISECONDS)
   datanode_1  | 2019-09-11 18:17:41 INFO  Client:948 - Retrying connect to 
server: datanode/172.19.0.2:9861. Already tried 44 time(s); retry policy is 
RetryUpToMaximumCountWithFixedSleep(maxRetries=2147483647, sleepTime=1000 
MILLISECONDS)
   datanode_1  | 2019-09-11 18:17:43 INFO  Client:948 - Retrying connect to 
server: datanode/172.19.0.2:9861. Already tried 45 time(s); retry policy is 
RetryUpToMaximumCountWithFixedSleep(maxRetries=2147483647, sleepTime=1000 
MILLISECONDS)
   datanode_1  | 2019-09-11 18:17:44 INFO  Client:948 - Retrying connect to 
server: datanode/172.19.0.2:9861. Already tried 46 time(s); retry policy is 
RetryUpToMaximumCountWithFixedSleep(maxRetries=2147483647, sleepTime=1000 
MILLISECONDS)
   datanode_1  | 2019-09-11 18:17:45 INFO  Client:948 - Retrying connect to 
server: datanode/172.19.0.2:9861. Already tried 47 time(s); retry policy is 
RetryUpToMaximumCountWithFixedSleep(maxRetries=2147483647, sleepTime=1000 
MILLISECONDS)
   datanode_1  | 2019-09-11 18:17:45 INFO  Client:948 - Retrying connect to 
server: datanode/172.19.0.2:9861. Already tried 48 time(s); retry policy is 
RetryUpToMaximumCountWithFixedSleep(maxRetries=2147483647, sleepTime=1000 
MILLISECONDS)
   datanode_1  | 2019-09-11 18:17:46 INFO  Client:948 - Retrying connect to 
server: datanode/172.19.0.2:9861. Already tried 49 time(s); retry policy is 
RetryUpToMaximumCountWithFixedSleep(maxRetries=2147483647, sleepTime=1000 
MILLISECONDS)
   datanode_1  | 2019-09-11 18:17:47 INFO  Client:948 - Retrying connect to 
server: datanode/172.19.0.2:9861. Already tried 50 time(s); retry policy is 
RetryUpToMaximumCountWithFixedSleep(maxRetries=2147483647, sleepTime=1000 
MILLISECONDS)
   datanode_1  | 2019-09-11 18:17:48 INFO  Client:948 - Retrying connect to 
server: datanode/172.19.0.2:9861. Already tried 51 time(s); retry policy is 
RetryUpToMaximumCountWithFixedSleep(maxRetries=2147483647, sleepTime=1000 
MILLISECONDS)
   datanode_1  | 2019-09-11 18:17:49 INFO  Client:948 - Retrying connect to 
server: datanode/172.19.0.2:9861. Already tried 52 time(s); retry policy is 
RetryUpToMaximumCountWithFixedSleep(maxRetries=2147483647, sleepTime=1000 
MILLISECONDS)
   datanode_1  | 2019-09-11 18:17:50 INFO  Client:948 - Retrying connect to 
server: datanode/172.19.0.2:9861. Already tried 53 time(s); retry policy is 
RetryUpToMaximumCountWithFixedSleep(maxRetries=2147483647, sleepTime=1000 
MILLISECONDS)
   datanode_1  | 2019-09-11 18:17:51 INFO  Client:948 - Retrying connect to 
server: datanode/172.19.0.2:9861. Already tried 54 time(s); retry policy is 
RetryUpToMaximumCountWithFixedSleep(maxRetries=2147483647, sleepTime=1000 
MILLISECONDS)
   datanode_1  | 2019-09-11 18:17:52 INFO  Client:948 - Retrying connect to 
server: datanode/172.19.0.2:9861. Already tried 55 time(s); retry policy is 
RetryUpToMaximumCountWithFixedSleep(maxRetries=2147483647, sleepTime=1000 
MILLISECONDS)
   datanode_1  | 2019-09-11 18:17:53 INFO  Client:948 - Retrying connect to 
server: datanode/172.19.0.2:9861. Already tried 56 time(s); retry policy is 
RetryUpToMaximumCountWithFixedSleep(maxRetries=2147483647, sleepTime=1000 
MILLISECONDS)
   datanode_1  | 2019-09-11 18:17:54 INFO  Client:948 - Retrying connect to 
server: datanode/172.19.0.2:9861. Already tried 57 time(s); retry policy is 
RetryUpToMaximumCountWithFixedSleep(maxRetries=2147483647, sleepTime=1000 
MILLISECONDS)
   datanode_1  | 2019-09-11 18:17:55 INFO  Client:948 - Retrying connect to 
server: datanode/172.19.0.2:9861. Already tried 58 time(s); retry policy is 
RetryUpToMaximumCountWithFixedSleep(maxRetries=2147483647, sleepTime=1000 
MILLISECONDS)
   datanode_1  | 2019-09-11 18:17:56 INFO  Client:948 - Retrying connect to 
server: datanode/172.19.0.2:9861. Already tried 59 time(s); retry policy is 
RetryUpToMaximumCountWithFixedSleep(maxRetries=2147483647, sleepTime=1000 
MILLISECONDS)
   datanode_1  | 2019-09-11 18:17:58 INFO  Client:948 - Retrying connect to 
server: datanode/172.19.0.2:9861. Already tried 60 time(s); retry policy is 
RetryUpToMaximumCountWithFixedSleep(maxRetries=2147483647, sleepTime=1000 
MILLISECONDS)
   datanode_1  | 2019-09-11 18:17:59 INFO  Client:948 - Retrying connect to 
server: datanode/172.19.0.2:9861. Already tried 61 time(s); retry policy is 
RetryUpToMaximumCountWithFixedSleep(maxRetries=2147483647, sleepTime=1000 
MILLISECONDS)
   datanode_1  | 2019-09-11 18:18:00 INFO  Client:948 - Retrying connect to 
server: datanode/172.19.0.2:9861. Already tried 62 time(s); retry policy is 
RetryUpToMaximumCountWithFixedSleep(maxRetries=2147483647, sleepTime=1000 
MILLISECONDS)
   datanode_1  | 2019-09-11 18:18:01 INFO  Client:948 - Retrying connect to 
server: datanode/172.19.0.2:9861. Already tried 63 time(s); retry policy is 
RetryUpToMaximumCountWithFixedSleep(maxRetries=2147483647, sleepTime=1000 
MILLISECONDS)
   datanode_1  | 2019-09-11 18:18:02 INFO  Client:948 - Retrying connect to 
server: datanode/172.19.0.2:9861. Already tried 64 time(s); retry policy is 
RetryUpToMaximumCountWithFixedSleep(maxRetries=2147483647, sleepTime=1000 
MILLISECONDS)
   datanode_1  | 2019-09-11 18:18:03 INFO  Client:948 - Retrying connect to 
server: datanode/172.19.0.2:9861. Already tried 65 time(s); retry policy is 
RetryUpToMaximumCountWithFixedSleep(maxRetries=2147483647, sleepTime=1000 
MILLISECONDS)
   datanode_1  | 2019-09-11 18:18:04 INFO  Client:948 - Retrying connect to 
server: datanode/172.19.0.2:9861. Already tried 66 time(s); retry policy is 
RetryUpToMaximumCountWithFixedSleep(maxRetries=2147483647, sleepTime=1000 
MILLISECONDS)
   datanode_1  | 2019-09-11 18:18:05 INFO  Client:948 - Retrying connect to 
server: datanode/172.19.0.2:9861. Already tried 67 time(s); retry policy is 
RetryUpToMaximumCountWithFixedSleep(maxRetries=2147483647, sleepTime=1000 
MILLISECONDS)
   datanode_1  | 2019-09-11 18:18:06 INFO  Client:948 - Retrying connect to 
server: datanode/172.19.0.2:9861. Already tried 68 time(s); retry policy is 
RetryUpToMaximumCountWithFixedSleep(maxRetries=2147483647, sleepTime=1000 
MILLISECONDS)
   datanode_1  | 2019-09-11 18:18:07 INFO  Client:948 - Retrying connect to 
server: datanode/172.19.0.2:9861. Already tried 69 time(s); retry policy is 
RetryUpToMaximumCountWithFixedSleep(maxRetries=2147483647, sleepTime=1000 
MILLISECONDS)
   datanode_1  | 2019-09-11 18:18:08 INFO  Client:948 - Retrying connect to 
server: datanode/172.19.0.2:9861. Already tried 70 time(s); retry policy is 
RetryUpToMaximumCountWithFixedSleep(maxRetries=2147483647, sleepTime=1000 
MILLISECONDS)
   datanode_1  | 2019-09-11 18:18:09 INFO  Client:948 - Retrying connect to 
server: datanode/172.19.0.2:9861. Already tried 71 time(s); retry policy is 
RetryUpToMaximumCountWithFixedSleep(maxRetries=2147483647, sleepTime=1000 
MILLISECONDS)
   datanode_1  | 2019-09-11 18:18:10 INFO  Client:948 - Retrying connect to 
server: datanode/172.19.0.2:9861. Already tried 72 time(s); retry policy is 
RetryUpToMaximumCountWithFixedSleep(maxRetries=2147483647, sleepTime=1000 
MILLISECONDS)
   datanode_1  | 2019-09-11 18:18:11 INFO  Client:948 - Retrying connect to 
server: datanode/172.19.0.2:9861. Already tried 73 time(s); retry policy is 
RetryUpToMaximumCountWithFixedSleep(maxRetries=2147483647, sleepTime=1000 
MILLISECONDS)
   datanode_1  | 2019-09-11 18:18:12 INFO  Client:948 - Retrying connect to 
server: datanode/172.19.0.2:9861. Already tried 74 time(s); retry policy is 
RetryUpToMaximumCountWithFixedSleep(maxRetries=2147483647, sleepTime=1000 
MILLISECONDS)
   datanode_1  | 2019-09-11 18:18:13 INFO  Client:948 - Retrying connect to 
server: datanode/172.19.0.2:9861. Already tried 75 time(s); retry policy is 
RetryUpToMaximumCountWithFixedSleep(maxRetries=2147483647, sleepTime=1000 
MILLISECONDS)
   datanode_1  | 2019-09-11 18:18:14 INFO  Client:948 - Retrying connect to 
server: datanode/172.19.0.2:9861. Already tried 76 time(s); retry policy is 
RetryUpToMaximumCountWithFixedSleep(maxRetries=2147483647, sleepTime=1000 
MILLISECONDS)
   datanode_1  | 2019-09-11 18:18:15 INFO  Client:948 - Retrying connect to 
server: datanode/172.19.0.2:9861. Already tried 77 time(s); retry policy is 
RetryUpToMaximumCountWithFixedSleep(maxRetries=2147483647, sleepTime=1000 
MILLISECONDS)
   datanode_1  | 2019-09-11 18:18:16 INFO  Client:948 - Retrying connect to 
server: datanode/172.19.0.2:9861. Already tried 78 time(s); retry policy is 
RetryUpToMaximumCountWithFixedSleep(maxRetries=2147483647, sleepTime=1000 
MILLISECONDS)
   ```
    
 
----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
-------------------

    Worklog Id:     (was: 310907)
    Time Spent: 50m  (was: 40m)

> Datanodes should retry forever to connect to SCM in an unsecure environment
> ---------------------------------------------------------------------------
>
>                 Key: HDDS-2107
>                 URL: https://issues.apache.org/jira/browse/HDDS-2107
>             Project: Hadoop Distributed Data Store
>          Issue Type: Bug
>          Components: Ozone Datanode
>    Affects Versions: 0.4.1
>            Reporter: Vivek Ratnavel Subramanian
>            Assignee: Vivek Ratnavel Subramanian
>            Priority: Major
>              Labels: pull-request-available
>          Time Spent: 50m
>  Remaining Estimate: 0h
>
> In an unsecure environment, the datanodes try upto 10 times after waiting for 
> 1000 milliseconds each time before throwing this error:
> {code:java}
> Unable to communicate to SCM server at scm:9861 for past 0 seconds.
> java.net.ConnectException: Call From scm/10.65.36.118 to scm:9861 failed on 
> connection exception: java.net.ConnectException: Connection refused; For more 
> details see:  http://wiki.apache.org/hadoop/ConnectionRefused
>       at sun.reflect.NativeConstructorAccessorImpl.newInstance0(Native Method)
>       at 
> sun.reflect.NativeConstructorAccessorImpl.newInstance(NativeConstructorAccessorImpl.java:62)
>       at 
> sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:45)
>       at java.lang.reflect.Constructor.newInstance(Constructor.java:423)
>       at org.apache.hadoop.net.NetUtils.wrapWithMessage(NetUtils.java:831)
>       at org.apache.hadoop.net.NetUtils.wrapException(NetUtils.java:755)
>       at org.apache.hadoop.ipc.Client.getRpcResponse(Client.java:1515)
>       at org.apache.hadoop.ipc.Client.call(Client.java:1457)
>       at org.apache.hadoop.ipc.Client.call(Client.java:1367)
>       at 
> org.apache.hadoop.ipc.ProtobufRpcEngine$Invoker.invoke(ProtobufRpcEngine.java:228)
>       at 
> org.apache.hadoop.ipc.ProtobufRpcEngine$Invoker.invoke(ProtobufRpcEngine.java:116)
>       at com.sun.proxy.$Proxy33.getVersion(Unknown Source)
>       at 
> org.apache.hadoop.ozone.protocolPB.StorageContainerDatanodeProtocolClientSideTranslatorPB.getVersion(StorageContainerDatanodeProtocolClientSideTranslatorPB.java:112)
>       at 
> org.apache.hadoop.ozone.container.common.states.endpoint.VersionEndpointTask.call(VersionEndpointTask.java:70)
>       at 
> org.apache.hadoop.ozone.container.common.states.endpoint.VersionEndpointTask.call(VersionEndpointTask.java:42)
>       at java.util.concurrent.FutureTask.run(FutureTask.java:266)
>       at 
> java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511)
>       at java.util.concurrent.FutureTask.run(FutureTask.java:266)
>       at 
> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
>       at 
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
>       at java.lang.Thread.run(Thread.java:748)
> Caused by: java.net.ConnectException: Connection refused
>       at sun.nio.ch.SocketChannelImpl.checkConnect(Native Method)
>       at 
> sun.nio.ch.SocketChannelImpl.finishConnect(SocketChannelImpl.java:717)
>       at 
> org.apache.hadoop.net.SocketIOWithTimeout.connect(SocketIOWithTimeout.java:206)
>       at org.apache.hadoop.net.NetUtils.connect(NetUtils.java:531)
>       at 
> org.apache.hadoop.ipc.Client$Connection.setupConnection(Client.java:690)
>       at 
> org.apache.hadoop.ipc.Client$Connection.setupIOstreams(Client.java:794)
>       at org.apache.hadoop.ipc.Client$Connection.access$3700(Client.java:411)
>       at org.apache.hadoop.ipc.Client.getConnection(Client.java:1572)
>       at org.apache.hadoop.ipc.Client.call(Client.java:1403)
>       ... 13 more
> {code}
> The datanodes should try forever to connect with SCM and not fail immediately 
> after 10 retries.



--
This message was sent by Atlassian Jira
(v8.3.2#803003)

---------------------------------------------------------------------
To unsubscribe, e-mail: hdfs-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: hdfs-issues-h...@hadoop.apache.org

Reply via email to