sodonnel commented on PR #3977: URL: https://github.com/apache/ozone/pull/3977#issuecomment-1322422446
What will be the result of this change if a node is down for example, and the call fails after a connection timeout sort of error, where the failure doesn't happen immediately? Eg the call fails after 60 seconds. Will it retry 5 times, hanging for 60 seconds each time? If the DN process is down, but the host is up, usually it will get a connection refused immediately. Will that still retry 5 times? I guess we are wanting to retry for intermittent failures (eg gc pause), but in the case of a host or process being down, this may lengthen the time for a real failure. I am not sure what is the best to go on this - there probably isn't a perfect solution. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
