[ https://issues.apache.org/jira/browse/YARN-4217?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14940072#comment-14940072 ]
Varun Vasudev commented on YARN-4217: ------------------------------------- [~eepayne] - is this a duplicate of YARN-2005? > Failed AM attempt retries on same failed host > --------------------------------------------- > > Key: YARN-4217 > URL: https://issues.apache.org/jira/browse/YARN-4217 > Project: Hadoop YARN > Issue Type: Improvement > Components: applications > Affects Versions: 2.7.1 > Reporter: Eric Payne > > This happens when the cluster is maxed out. One node is going bad, so > everything that happens on it fails, so the bad node is never busy. Since the > cluster is maxed out, when the RM looks for a node with available resources, > it will always find the almost bad one because nothing can run on it so it > has available resources. -- This message was sent by Atlassian JIRA (v6.3.4#6332)