[ 
https://issues.apache.org/jira/browse/HDDS-1836?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16890636#comment-16890636
 ] 

Shashikant Banerjee edited comment on HDDS-1836 at 7/23/19 3:02 AM:
--------------------------------------------------------------------

[~arp], in test environment, with a limited set of dns, let's say one dn 
instance could not connect to its peers. In such cases as well, the leader 
election will take much longer time to fail/timeout and hence dns will not be 
freed up to be part of different/new pipeline and hence we can run into a 
situation where pipeline creation is not possible/delayed in such setups.

Also, we have a client request timeout of by 3s by default. If the leader 
election min timeout is considerably larger than the request timeout, we can 
see multiple request timeout with a single leader election window.


was (Author: shashikant):
[~arp], in test environment, with a limited set of dns, let's say one dn 
instance could not connect to its peers. In such cases as well, the leader 
election will take much longer time to fail/timeout and hence dns will not be 
freed up to be part of different/new pipeline and hence we can run into a 
situation where pipeline creation is not possible/delayed in such setups.

Also, we have a client request timeout of by 3s by default. If the leader 
election min timeout is considerably larger than the request timeout, we can 
see multiple request timeout with a single leader election window.

> Change the default value of ratis leader election min timeout to a lower value
> ------------------------------------------------------------------------------
>
>                 Key: HDDS-1836
>                 URL: https://issues.apache.org/jira/browse/HDDS-1836
>             Project: Hadoop Distributed Data Store
>          Issue Type: Bug
>          Components: Ozone Datanode
>    Affects Versions: 0.5.0
>            Reporter: Shashikant Banerjee
>            Assignee: Shashikant Banerjee
>            Priority: Major
>              Labels: pull-request-available
>             Fix For: 0.5.0
>
>          Time Spent: 0.5h
>  Remaining Estimate: 0h
>
> The default value of min leader election timeout currently is 5s(done with 
> HDDS-1718) by default which is leading to leader election taking much longer 
> time to timeout in case of network failures and leading to delayed creation 
> of pipelines in the system. The idea is to change the default value to a 
> lower value of "2s" for now.



--
This message was sent by Atlassian JIRA
(v7.6.14#76016)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to