Sunil G created YARN-2293:
-----------------------------

             Summary: Scoring for NMs to identify a better candidate to launch 
AMs
                 Key: YARN-2293
                 URL: https://issues.apache.org/jira/browse/YARN-2293
             Project: Hadoop YARN
          Issue Type: Improvement
          Components: nodemanager, resourcemanager
            Reporter: Sunil G
            Assignee: Sunil G


Container exit status from NM is giving indications of reasons for its failure. 
Some times, it may be because of container launching problems in NM. In a 
heterogeneous cluster, some machines with weak hardware may cause more 
failures. It will be better not to launch AMs there more often. Also I would 
like to clear that container failures because of buggy job should not result in 
decreasing score. 
As mentioned earlier, based on exit status if a scoring mechanism is added for 
NMs in RM, then NMs with better scores can be given for launching AMs. Thoughts?




--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to