[ 
https://issues.apache.org/jira/browse/FLINK-23403?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17463077#comment-17463077
 ] 

Till Rohrmann commented on FLINK-23403:
---------------------------------------

I do agree that the change might not work for every deployment. For large scale 
deployments, I think one probably has to set a different heartbeat interval and 
timeout.

Adjusting the heartbeat settings dynamically might also work but is more 
complex and more error-prone if not done correctly. Maybe we can try this as a 
follow-up.

> Decrease default values for heartbeat timeout and interval
> ----------------------------------------------------------
>
>                 Key: FLINK-23403
>                 URL: https://issues.apache.org/jira/browse/FLINK-23403
>             Project: Flink
>          Issue Type: Improvement
>          Components: Runtime / Configuration, Runtime / Coordination
>    Affects Versions: 1.14.0
>            Reporter: Till Rohrmann
>            Assignee: Till Rohrmann
>            Priority: Major
>              Labels: pull-request-available, stale-assigned
>             Fix For: 1.15.0
>
>
> In order to speed up failure detection I suggest to decrease the default 
> values for the heartbeat timeout and interval from 50s/10s to 15s/3s.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

Reply via email to