Henry Saputra created TWILL-132:
-----------------------------------

             Summary: Twill AppMaster does not restart on failure as many times 
specified in yarn.resourcemanager.am.max-attempts
                 Key: TWILL-132
                 URL: https://issues.apache.org/jira/browse/TWILL-132
             Project: Apache Twill
          Issue Type: Bug
          Components: yarn
    Affects Versions: 0.5.0-incubating
            Reporter: Henry Saputra


The Apache Hadoop YARN has config property that allow restart on AM due to 
failures certain amount of times. It is specified by 
yarn.resourcemanager.am.max-attempts (default is 2).

So, this config parameter allow AM to have HA like behavior to resilient to 
failures up to certain number of times.

The Twill AppMaster seems to have problem restarting when failed due to 
uncertain condition (e.g.: kill signal)

<code>
    <name>yarn.resourcemanager.am.max-attempts</name>
    <value>2</value>
  </property>
</code>



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to