[ 
https://issues.apache.org/jira/browse/FLINK-32895?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17756636#comment-17756636
 ] 

Rui Fan commented on FLINK-32895:
---------------------------------

Hi [~zhuzh][~wanglijie], would you mind helping take a look this JIRA in your 
free time? thanks~

BTW, if the improvement is reasonable, it will add a new option and a new 
`exponentialDelayRestart` method in `RestartStrategies` (It's a PublicEvolving 
class), it's a small feature.

I'm not sure whether the FLIP is necessary when adding any options or changing 
any public classes even if it's a small feature. If yes, I can start a FLIP, if 
no, I will follow it here.

> Introduce the max attempts for Exponential Delay Restart Strategy
> -----------------------------------------------------------------
>
>                 Key: FLINK-32895
>                 URL: https://issues.apache.org/jira/browse/FLINK-32895
>             Project: Flink
>          Issue Type: Improvement
>          Components: Runtime / Coordination
>            Reporter: Rui Fan
>            Assignee: Rui Fan
>            Priority: Major
>
> Currently, Flink has 3 restart strategies, they are: fixed-delay, 
> failure-rate and exponential-delay.
> The exponential-delay is suitable if a job continues to fail for a period of 
> time. The fixed-delay and failure-rate has the max attempts mechanism, that 
> means, the job won't restart and go to fail after the attempt exceeds the 
> threshold of max attemepts. 
> The max attempts mechanism is reasonable, flink should not or need to 
> infinitely restart the job if the job keeps failing. However, the 
> exponential-delay doesn't have the max attempts mechanism.
> I propose inctroducing the 
> `restart-strategy.exponential-delay.max-attempts-before-reset` to support the 
> max attempts mechanism for exponential-delay. It means flink won't restart 
> job if the number of job failures before reset exceeds 
> max-attempts-before-reset when is exponential-delay is enabled.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to