fanxin created FLINK-17871:
------------------------------

             Summary: Make the default value of attemptFailuresValidityInterval 
more reasonable
                 Key: FLINK-17871
                 URL: https://issues.apache.org/jira/browse/FLINK-17871
             Project: Flink
          Issue Type: Improvement
          Components: Deployment / YARN
            Reporter: fanxin


Default value of `yarn.application-attempt-failures-validity-interval` is 
`10000` milliseconds at present. Usually preparing the context alone can take 
seconds, which means that default value 10000 is too small even to ready a 
runtime context. With a default config, a flink on yarn job in will hardly meet 
the condition of ”fail 2 times in 10s“. If the job has some internal problems, 
unfortunately, it can easily get bogged down in endless retries.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to