[ https://issues.apache.org/jira/browse/YARN-2822?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14200977#comment-14200977 ]
Jian He commented on YARN-2822: ------------------------------- The problem is on recovery, if the previous attempt already finished, we are not adding it the scheduler. when scheduler tries to transferStateFromPreviousAttempt for work-presrving AM restart, it throws NPE. > NPE when RM tries to transfer state from previous attempt on recovery > --------------------------------------------------------------------- > > Key: YARN-2822 > URL: https://issues.apache.org/jira/browse/YARN-2822 > Project: Hadoop YARN > Issue Type: Sub-task > Components: resourcemanager > Reporter: Jian He > Assignee: Jian He > > {code} > 2014-09-16 01:36:28,037 FATAL resourcemanager.ResourceManager > (ResourceManager.java:run(612)) - Error in handling event type > APP_ATTEMPT_ADDED to the scheduler > java.lang.NullPointerException > at > org.apache.hadoop.yarn.server.resourcemanager.scheduler.SchedulerApplicationAttempt.transferStateFromPreviousAttempt(SchedulerApplicationAttempt.java:530) > at > org.apache.hadoop.yarn.server.resourcemanager.scheduler.capacity.CapacityScheduler.addApplicationAttempt(CapacityScheduler.java:678) > at > org.apache.hadoop.yarn.server.resourcemanager.scheduler.capacity.CapacityScheduler.handle(CapacityScheduler.java:1015) > at > org.apache.hadoop.yarn.server.resourcemanager.scheduler.capacity.CapacityScheduler.handle(CapacityScheduler.java:98) > at > org.apache.hadoop.yarn.server.resourcemanager.ResourceManager$SchedulerEventDispatcher$EventProcessor.run(ResourceManager.java:603) > at java.lang.Thread.run(Thread.java:744) > {code} -- This message was sent by Atlassian JIRA (v6.3.4#6332)