[ https://issues.apache.org/jira/browse/YARN-3675?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14556201#comment-14556201 ]
Hudson commented on YARN-3675: ------------------------------ FAILURE: Integrated in Hadoop-Hdfs-trunk #2133 (See [https://builds.apache.org/job/Hadoop-Hdfs-trunk/2133/]) YARN-3675. FairScheduler: RM quits when node removal races with continuous-scheduling on the same node. (Anubhav Dhoot via kasha) (kasha: rev 4513761869c732cf2f462763043067ebf8749df7) * hadoop-yarn-project/CHANGES.txt * hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/main/java/org/apache/hadoop/yarn/server/resourcemanager/scheduler/fair/FairScheduler.java * hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/test/java/org/apache/hadoop/yarn/server/resourcemanager/scheduler/fair/TestFairScheduler.java > FairScheduler: RM quits when node removal races with continousscheduling on > the same node > ----------------------------------------------------------------------------------------- > > Key: YARN-3675 > URL: https://issues.apache.org/jira/browse/YARN-3675 > Project: Hadoop YARN > Issue Type: Bug > Components: fairscheduler > Reporter: Anubhav Dhoot > Assignee: Anubhav Dhoot > Priority: Critical > Fix For: 2.7.1 > > Attachments: YARN-3675.001.patch, YARN-3675.002.patch, > YARN-3675.003.patch > > > With continuous scheduling, scheduling can be done on a node thats just > removed causing errors like below. > {noformat} > 12:28:53.782 AM FATAL > org.apache.hadoop.yarn.server.resourcemanager.ResourceManager > Error in handling event type APP_ATTEMPT_REMOVED to the scheduler > java.lang.NullPointerException > at > org.apache.hadoop.yarn.server.resourcemanager.scheduler.fair.FSAppAttempt.unreserve(FSAppAttempt.java:469) > at > org.apache.hadoop.yarn.server.resourcemanager.scheduler.fair.FairScheduler.completedContainer(FairScheduler.java:815) > at > org.apache.hadoop.yarn.server.resourcemanager.scheduler.fair.FairScheduler.removeApplicationAttempt(FairScheduler.java:763) > at > org.apache.hadoop.yarn.server.resourcemanager.scheduler.fair.FairScheduler.handle(FairScheduler.java:1217) > at > org.apache.hadoop.yarn.server.resourcemanager.scheduler.fair.FairScheduler.handle(FairScheduler.java:111) > at > org.apache.hadoop.yarn.server.resourcemanager.ResourceManager$SchedulerEventDispatcher$EventProcessor.run(ResourceManager.java:684) > at java.lang.Thread.run(Thread.java:745) > 12:28:53.783 AM INFO > org.apache.hadoop.yarn.server.resourcemanager.ResourceManager Exiting, bbye.. > {noformat} -- This message was sent by Atlassian JIRA (v6.3.4#6332)