[
https://issues.apache.org/jira/browse/YARN-5036?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Wangda Tan reassigned YARN-5036:
--------------------------------
Assignee: Wangda Tan
> RM crashed with NPE while handling CONTAINER_EXPIRED event
> ----------------------------------------------------------
>
> Key: YARN-5036
> URL: https://issues.apache.org/jira/browse/YARN-5036
> Project: Hadoop YARN
> Issue Type: Bug
> Components: capacityscheduler, resourcemanager
> Affects Versions: 2.9.0
> Reporter: Karam Singh
> Assignee: Wangda Tan
>
> I was running some tpcds queries against branch-2, build and after some hours
> RM crashed with following exception:
> 2016-04-30 08:40:34,332 [Ping Checker] INFO
> org.apache.hadoop.yarn.util.AbstractLivelinessMonitor:
> Expired:<container=container_1461941833306_0345_01_007806, increase=false>
> Timed out after 600 secs
> 2016-04-30 08:40:34,333 [ResourceManager Event Processor] FATAL
> org.apache.hadoop.yarn.server.resourcemanager.ResourceManager: Error in
> handling event type CONTAINER_EXPIRED to the scheduler
> java.lang.NullPointerException
> at
> org.apache.hadoop.yarn.server.resourcemanager.scheduler.capacity.LeafQueue.completedContainer(LeafQueue.java:1327)
> at
> org.apache.hadoop.yarn.server.resourcemanager.scheduler.capacity.CapacityScheduler.completedContainerInternal(CapacityScheduler.java:1595)
> at
> org.apache.hadoop.yarn.server.resourcemanager.scheduler.AbstractYarnScheduler.completedContainer(AbstractYarnScheduler.java:527)
> at
> org.apache.hadoop.yarn.server.resourcemanager.scheduler.capacity.CapacityScheduler.handle(CapacityScheduler.java:1430)
> at
> org.apache.hadoop.yarn.server.resourcemanager.scheduler.capacity.CapacityScheduler.handle(CapacityScheduler.java:139)
> at
> org.apache.hadoop.yarn.server.resourcemanager.ResourceManager$SchedulerEventDispatcher$EventProcessor.run(ResourceManager.java:771)
> at java.lang.Thread.run(Thread.java:745)
> 2016-04-30 08:40:34,333 [ResourceManager Event Processor] INFO
> org.apache.hadoop.yarn.server.resourcemanager.ResourceManager: Exiting, bbye..
> 2016-04-30 08:40:36,632 [Thread[Thread-185,5,main]] ERROR
> org.apache.hadoop.security.token.delegation.AbstractDelegationTokenSecretManager:
> ExpiredTokenRemover received java.lang.InterruptedException:
> sleep interrupted
> /cc [~leftnoteasy]
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]