[ https://issues.apache.org/jira/browse/TWILL-181?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15840301#comment-15840301 ]
ASF GitHub Bot commented on TWILL-181: -------------------------------------- Github user poornachandra commented on a diff in the pull request: https://github.com/apache/twill/pull/23#discussion_r98075925 --- Diff: twill-yarn/src/main/java/org/apache/twill/internal/appmaster/RunningContainers.java --- @@ -113,9 +117,11 @@ public Integer apply(BitSet input) { private final Location applicationLocation; private final Set<String> runnableNames; private final Map<String, Map<String, String>> logLevels; + private final Map<String, Integer> maxRetries; --- End diff -- On further thought, I think if we track the restarts per instance id then it would simplify the contract for max retries. This will be consistent with what the javadoc says too - ``` Sets the maximum number of times (per instance) a runnable will be retried if it exits without success. The default behavior is to retry indefinitely. ``` > Control the maximum number of retries for failed application starts > ------------------------------------------------------------------- > > Key: TWILL-181 > URL: https://issues.apache.org/jira/browse/TWILL-181 > Project: Apache Twill > Issue Type: Improvement > Components: yarn > Affects Versions: 0.7.0-incubating > Reporter: Martin Serrano > Assignee: Martin Serrano > Fix For: 0.10.0 > > > If an application consistently exits with a non-zero code, twill will > attempt to restart indefinitely. I ran into this issue and a list search > also reveals [others| http://markmail.org/message/dehx7r6tpqgcmjh4]. > There should be a mechanism to specify the maximum number of retries until > the application fails. Ideally by default there would be a non-infinite > maximum. -- This message was sent by Atlassian JIRA (v6.3.4#6332)