[ https://issues.apache.org/jira/browse/YARN-1801?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14103649#comment-14103649 ]
Beckham007 commented on YARN-1801: ---------------------------------- When something got wrong with hdfs, this error would happen. This NPE make NM crash.So I think we should fix this in yarn. 2014-08-20 10:21:04,004 INFO org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService: Failed to download rsrc { { hdfs://...:54310/tmp/temp-793434835/tmp-707424512/CosAgent.jar, 1408501159584, FILE, null },pending,[(container_1407229860715_13071531_01_000087)],18021755091999344,DOWNLOADING} java.io.FileNotFoundException: File does not exist: hdfs://...:54310/tmp/temp-793434835/tmp-707424512/CosAgent.jar 2014-08-20 10:21:04,032 FATAL org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService: Error: Shutting down java.lang.NullPointerException at org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService$PublicLocalizer.run(ResourceLocalizationService.java:712) 2014-08-20 10:21:04,032 INFO org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService: Public cache exiting 2014-08-20 10:21:04,052 FATAL org.apache.hadoop.yarn.event.AsyncDispatcher: Error in dispatcher thread java.util.concurrent.RejectedExecutionException > NPE in public localizer > ----------------------- > > Key: YARN-1801 > URL: https://issues.apache.org/jira/browse/YARN-1801 > Project: Hadoop YARN > Issue Type: Sub-task > Components: nodemanager > Reporter: Jason Lowe > Assignee: Hong Zhiguo > Priority: Critical > Attachments: YARN-1801.patch > > > While investigating YARN-1800 found this in the NM logs that caused the > public localizer to shutdown: > {noformat} > 2014-01-23 01:26:38,655 INFO localizer.ResourceLocalizationService > (ResourceLocalizationService.java:addResource(651)) - Downloading public > rsrc:{ > hdfs://colo-2:8020/user/fertrist/oozie-oozi/0000601-140114233013619-oozie-oozi-W/aggregator--map-reduce/map-reduce-launcher.jar, > 1390440382009, FILE, null } > 2014-01-23 01:26:38,656 FATAL localizer.ResourceLocalizationService > (ResourceLocalizationService.java:run(726)) - Error: Shutting down > java.lang.NullPointerException > at > org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService$PublicLocalizer.run(ResourceLocalizationService.java:712) > 2014-01-23 01:26:38,656 INFO localizer.ResourceLocalizationService > (ResourceLocalizationService.java:run(728)) - Public cache exiting > {noformat} -- This message was sent by Atlassian JIRA (v6.2#6252)