[
https://issues.apache.org/jira/browse/YARN-10934?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17493631#comment-17493631
]
Yuan Luo commented on YARN-10934:
---------------------------------
After applying this patch to our cluster, the problem was fixed. Thank you very
much! [~bteke]
> LeafQueue activateApplications NPE
> ----------------------------------
>
> Key: YARN-10934
> URL: https://issues.apache.org/jira/browse/YARN-10934
> Project: Hadoop YARN
> Issue Type: Bug
> Components: RM
> Affects Versions: 3.3.1
> Reporter: Yuan Luo
> Assignee: Benjamin Teke
> Priority: Major
> Labels: pull-request-available
> Fix For: 3.4.0
>
> Attachments: RM-capacity-scheduler.xml, RM-yarn-site.xml
>
> Time Spent: 1h 40m
> Remaining Estimate: 0h
>
> Our prod Yarn cluster is hadoop version 3.3.1 , we changed
> DefaultResourceCalculator -> DominantResourceCalculator and restart RM, then
> our RM crashed, the Exception stack like below. I think this is a serious
> bug and hope someone can follow up and fix it.
> {code:java}
> 2021-08-30 21:00:59,114 ERROR event.EventDispatcher
> (MarkerIgnoringBase.java:error(159)) - Error in handling event type
> APP_ATTEMPT_REMOVED to the Event Dispatcher
> java.lang.NullPointerException
> at
> org.apache.hadoop.yarn.server.resourcemanager.scheduler.capacity.LeafQueue.activateApplications(LeafQueue.java:868)
> at
> org.apache.hadoop.yarn.server.resourcemanager.scheduler.capacity.LeafQueue.removeApplicationAttempt(LeafQueue.java:1014)
> at
> org.apache.hadoop.yarn.server.resourcemanager.scheduler.capacity.LeafQueue.finishApplicationAttempt(LeafQueue.java:972)
> at
> org.apache.hadoop.yarn.server.resourcemanager.scheduler.capacity.CapacityScheduler.doneApplicationAttempt(CapacityScheduler.java:1188)
> at
> org.apache.hadoop.yarn.server.resourcemanager.scheduler.capacity.CapacityScheduler.handle(CapacityScheduler.java:1904)
> at
> org.apache.hadoop.yarn.server.resourcemanager.scheduler.capacity.CapacityScheduler.handle(CapacityScheduler.java:171)
> at
> org.apache.hadoop.yarn.event.EventDispatcher$EventProcessor.run(EventDispatcher.java:79)
> at java.base/java.lang.Thread.run(Thread.java:834)
> {code}
--
This message was sent by Atlassian Jira
(v8.20.1#820001)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]