[
https://issues.apache.org/jira/browse/YARN-1801?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14103649#comment-14103649
]
Beckham007 commented on YARN-1801:
----------------------------------
When something got wrong with hdfs, this error would happen.
This NPE make NM crash.So I think we should fix this in yarn.
2014-08-20 10:21:04,004 INFO
org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService:
Failed to download rsrc { {
hdfs://...:54310/tmp/temp-793434835/tmp-707424512/CosAgent.jar, 1408501159584,
FILE, null
},pending,[(container_1407229860715_13071531_01_000087)],18021755091999344,DOWNLOADING}
java.io.FileNotFoundException: File does not exist:
hdfs://...:54310/tmp/temp-793434835/tmp-707424512/CosAgent.jar
2014-08-20 10:21:04,032 FATAL
org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService:
Error: Shutting down
java.lang.NullPointerException
at
org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService$PublicLocalizer.run(ResourceLocalizationService.java:712)
2014-08-20 10:21:04,032 INFO
org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService:
Public cache exiting
2014-08-20 10:21:04,052 FATAL org.apache.hadoop.yarn.event.AsyncDispatcher:
Error in dispatcher thread
java.util.concurrent.RejectedExecutionException
> NPE in public localizer
> -----------------------
>
> Key: YARN-1801
> URL: https://issues.apache.org/jira/browse/YARN-1801
> Project: Hadoop YARN
> Issue Type: Sub-task
> Components: nodemanager
> Reporter: Jason Lowe
> Assignee: Hong Zhiguo
> Priority: Critical
> Attachments: YARN-1801.patch
>
>
> While investigating YARN-1800 found this in the NM logs that caused the
> public localizer to shutdown:
> {noformat}
> 2014-01-23 01:26:38,655 INFO localizer.ResourceLocalizationService
> (ResourceLocalizationService.java:addResource(651)) - Downloading public
> rsrc:{
> hdfs://colo-2:8020/user/fertrist/oozie-oozi/0000601-140114233013619-oozie-oozi-W/aggregator--map-reduce/map-reduce-launcher.jar,
> 1390440382009, FILE, null }
> 2014-01-23 01:26:38,656 FATAL localizer.ResourceLocalizationService
> (ResourceLocalizationService.java:run(726)) - Error: Shutting down
> java.lang.NullPointerException
> at
> org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService$PublicLocalizer.run(ResourceLocalizationService.java:712)
> 2014-01-23 01:26:38,656 INFO localizer.ResourceLocalizationService
> (ResourceLocalizationService.java:run(728)) - Public cache exiting
> {noformat}
--
This message was sent by Atlassian JIRA
(v6.2#6252)