aswinshakil commented on code in PR #3977:
URL: https://github.com/apache/ozone/pull/3977#discussion_r1029704987
##########
hadoop-hdds/common/src/main/resources/ozone-default.xml:
##########
@@ -3275,6 +3275,24 @@
</description>
</property>
+ <property>
+ <name>ozone.client.ec.grpc.retries.enabled</name>
+ <value>true</value>
+ <tag>CLIENT</tag>
+ <description>
+ To enable Grpc client retries for EC.
+ </description>
+ </property>
+
+ <property>
Review Comment:
The client timeout is specified by `ozone.client.read.timeout` which is by
default `30s`, After which GRPC throws `DEADLINE_EXCEEDED`. Under heavy load,
If we assume we get the same error for the 5 retries, then the wait time would
be `5 * 30s = 150s`.
One solution is that we can make the retry count to 3 (which also includes
transient retries grpc tries on its own) so we don't try as much because we
don't want to burden the DN that much as well. Or we can also try [throttling
the
retries](https://github.com/grpc/proposal/blob/master/A6-client-retries.md#throttling-retry-attempts-and-hedged-rpcs)
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]