[
https://issues.apache.org/jira/browse/FLINK-32895?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17756636#comment-17756636
]
Rui Fan commented on FLINK-32895:
---------------------------------
Hi [~zhuzh][~wanglijie], would you mind helping take a look this JIRA in your
free time? thanks~
BTW, if the improvement is reasonable, it will add a new option and a new
`exponentialDelayRestart` method in `RestartStrategies` (It's a PublicEvolving
class), it's a small feature.
I'm not sure whether the FLIP is necessary when adding any options or changing
any public classes even if it's a small feature. If yes, I can start a FLIP, if
no, I will follow it here.
> Introduce the max attempts for Exponential Delay Restart Strategy
> -----------------------------------------------------------------
>
> Key: FLINK-32895
> URL: https://issues.apache.org/jira/browse/FLINK-32895
> Project: Flink
> Issue Type: Improvement
> Components: Runtime / Coordination
> Reporter: Rui Fan
> Assignee: Rui Fan
> Priority: Major
>
> Currently, Flink has 3 restart strategies, they are: fixed-delay,
> failure-rate and exponential-delay.
> The exponential-delay is suitable if a job continues to fail for a period of
> time. The fixed-delay and failure-rate has the max attempts mechanism, that
> means, the job won't restart and go to fail after the attempt exceeds the
> threshold of max attemepts.
> The max attempts mechanism is reasonable, flink should not or need to
> infinitely restart the job if the job keeps failing. However, the
> exponential-delay doesn't have the max attempts mechanism.
> I propose inctroducing the
> `restart-strategy.exponential-delay.max-attempts-before-reset` to support the
> max attempts mechanism for exponential-delay. It means flink won't restart
> job if the number of job failures before reset exceeds
> max-attempts-before-reset when is exponential-delay is enabled.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)