[
https://issues.apache.org/jira/browse/FLINK-17871?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Konstantin Knauf reopened FLINK-17871:
--------------------------------------
Re-opening in accordance with https://issues.apache.org/jira/browse/FLINK-23206.
> Make the default value of attemptFailuresValidityInterval more reasonable
> -------------------------------------------------------------------------
>
> Key: FLINK-17871
> URL: https://issues.apache.org/jira/browse/FLINK-17871
> Project: Flink
> Issue Type: Improvement
> Components: Deployment / YARN
> Reporter: fanxin
> Priority: Minor
> Labels: auto-closed
>
> Default value of `yarn.application-attempt-failures-validity-interval` is
> `10000` milliseconds at present. Usually preparing the context alone can take
> seconds, which means that default value of 10000 is too small to even prepare
> the runtime context. With a default config, a flink on yarn job in will
> hardly meet the condition of ”fail 2 times in 10s“. If the job has some
> internal problems, unfortunately, it can easily get bogged down in endless
> retries.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)