aswinshakil commented on code in PR #3977:
URL: https://github.com/apache/ozone/pull/3977#discussion_r1029704987


##########
hadoop-hdds/common/src/main/resources/ozone-default.xml:
##########
@@ -3275,6 +3275,24 @@
     </description>
   </property>
 
+  <property>
+    <name>ozone.client.ec.grpc.retries.enabled</name>
+    <value>true</value>
+    <tag>CLIENT</tag>
+    <description>
+      To enable Grpc client retries for EC.
+    </description>
+  </property>
+
+  <property>

Review Comment:
   The client timeout is specified by `ozone.client.read.timeout` which is by 
default `30s`, After which GRPC throws `DEADLINE_EXCEEDED`. Under heavy load, 
If we assume we get the same error for the 5 retries, then the wait time would 
be `5 * 30s = 150s`. 
   
   One solution is that we can make the retry count to 3 (which also includes 
transient retries grpc tries on its own) so we don't try as much because we 
don't want to burden the DN that much as well. Or we can also try [throttling 
the 
retries](https://github.com/grpc/proposal/blob/master/A6-client-retries.md#throttling-retry-attempts-and-hedged-rpcs)



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to