Zhijie Shen created YARN-367:
--------------------------------
Summary: Exception when yarn.nodemanager.local-dirs is not
explicitly set
Key: YARN-367
URL: https://issues.apache.org/jira/browse/YARN-367
Project: Hadoop YARN
Issue Type: Bug
Components: nodemanager
Reporter: Zhijie Shen
If yarn.nodemanager.local-dirs is not explicitly set, and if the default
local-dirs are not the children of hadoop.tmp.dir, the exception will occur
when the wordcount example is run. Bellow is log info.
==========
2013-01-30 22:16:04,229 INFO
org.apache.hadoop.yarn.server.nodemanager.containermanager.ContainerManagerImpl:
Start request for container_1359612879014_0001_01_000001 by user zshen
2013-01-30 22:16:04,247 INFO
org.apache.hadoop.yarn.server.nodemanager.containermanager.ContainerManagerImpl:
Creating a new application reference for app application_1359612879014_0001
2013-01-30 22:16:04,250 INFO
org.apache.hadoop.yarn.server.nodemanager.NMAuditLogger: USER=zshen
IP=127.0.0.1 OPERATION=Start Container Request
TARGET=ContainerManageImpl RESULT=SUCCESS
APPID=application_1359612879014_0001
CONTAINERID=container_1359612879014_0001_01_000001
2013-01-30 22:16:04,252 INFO
org.apache.hadoop.yarn.server.nodemanager.containermanager.application.Application:
Application application_1359612879014_0001 transitioned from NEW to INITING
2013-01-30 22:16:04,252 INFO
org.apache.hadoop.yarn.server.nodemanager.containermanager.application.Application:
Adding container_1359612879014_0001_01_000001 to application
application_1359612879014_0001
2013-01-30 22:16:04,257 INFO
org.apache.hadoop.yarn.server.nodemanager.containermanager.application.Application:
Application application_1359612879014_0001 transitioned from INITING to RUNNING
2013-01-30 22:16:04,262 INFO
org.apache.hadoop.yarn.server.nodemanager.containermanager.container.Container:
Container container_1359612879014_0001_01_000001 transitioned from NEW to
LOCALIZING
2013-01-30 22:16:04,268 INFO
org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.LocalizedResource:
Resource
hdfs://localhost:9001/tmp/hadoop-yarn/staging/zshen/.staging/job_1359612879014_0001/appTokens
transitioned from INIT to DOWNLOADING
2013-01-30 22:16:04,268 INFO
org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.LocalizedResource:
Resource
hdfs://localhost:9001/tmp/hadoop-yarn/staging/zshen/.staging/job_1359612879014_0001/job.jar
transitioned from INIT to DOWNLOADING
2013-01-30 22:16:04,268 INFO
org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.LocalizedResource:
Resource
hdfs://localhost:9001/tmp/hadoop-yarn/staging/zshen/.staging/job_1359612879014_0001/job.splitmetainfo
transitioned from INIT to DOWNLOADING
2013-01-30 22:16:04,268 INFO
org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.LocalizedResource:
Resource
hdfs://localhost:9001/tmp/hadoop-yarn/staging/zshen/.staging/job_1359612879014_0001/job.split
transitioned from INIT to DOWNLOADING
2013-01-30 22:16:04,269 INFO
org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.LocalizedResource:
Resource
hdfs://localhost:9001/tmp/hadoop-yarn/staging/zshen/.staging/job_1359612879014_0001/job.xml
transitioned from INIT to DOWNLOADING
2013-01-30 22:16:04,269 INFO
org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService:
Created localizer for container_1359612879014_0001_01_000001
2013-01-30 22:16:04,401 INFO
org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService:
Writing credentials to the nmPrivate file
/tmp/hadoop-zshen/nm-local-dir/nmPrivate/container_1359612879014_0001_01_000001.tokens.
Credentials list:
2013-01-30 22:16:04,423 INFO
org.apache.hadoop.yarn.server.nodemanager.DefaultContainerExecutor:
Initializing user zshen
2013-01-30 22:16:04,569 INFO
org.apache.hadoop.yarn.server.nodemanager.DefaultContainerExecutor: Copying
from
/tmp/hadoop-zshen/nm-local-dir/nmPrivate/container_1359612879014_0001_01_000001.tokens
to
/tmp/hadoop-zshen/nm-local-dir/usercache/zshen/appcache/application_1359612879014_0001/container_1359612879014_0001_01_000001.tokens
2013-01-30 22:16:04,570 INFO
org.apache.hadoop.yarn.server.nodemanager.DefaultContainerExecutor: CWD set to
/tmp/hadoop-zshen/nm-local-dir/usercache/zshen/appcache/application_1359612879014_0001
=
file:/tmp/hadoop-zshen/nm-local-dir/usercache/zshen/appcache/application_1359612879014_0001
2013-01-30 22:16:04,955 INFO
org.apache.hadoop.yarn.server.nodemanager.NodeStatusUpdaterImpl: Sending out
status for container: container_id {, app_attempt_id {, application_id {, id:
1, cluster_timestamp: 1359612879014, }, attemptId: 1, }, id: 1, }, state:
C_RUNNING, diagnostics: "", exit_status: -1000,
2013-01-30 22:16:05,117 INFO
org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.LocalizedResource:
Resource
hdfs://localhost:9001/tmp/hadoop-yarn/staging/zshen/.staging/job_1359612879014_0001/appTokens
transitioned from DOWNLOADING to LOCALIZED
2013-01-30 22:16:05,312 INFO
org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.LocalizedResource:
Resource
hdfs://localhost:9001/tmp/hadoop-yarn/staging/zshen/.staging/job_1359612879014_0001/job.jar
transitioned from DOWNLOADING to LOCALIZED
2013-01-30 22:16:05,465 INFO
org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.LocalizedResource:
Resource
hdfs://localhost:9001/tmp/hadoop-yarn/staging/zshen/.staging/job_1359612879014_0001/job.splitmetainfo
transitioned from DOWNLOADING to LOCALIZED
2013-01-30 22:16:05,608 INFO
org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.LocalizedResource:
Resource
hdfs://localhost:9001/tmp/hadoop-yarn/staging/zshen/.staging/job_1359612879014_0001/job.split
transitioned from DOWNLOADING to LOCALIZED
2013-01-30 22:16:05,751 INFO
org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.LocalizedResource:
Resource
hdfs://localhost:9001/tmp/hadoop-yarn/staging/zshen/.staging/job_1359612879014_0001/job.xml
transitioned from DOWNLOADING to LOCALIZED
2013-01-30 22:16:05,752 INFO
org.apache.hadoop.yarn.server.nodemanager.containermanager.container.Container:
Container container_1359612879014_0001_01_000001 transitioned from LOCALIZING
to LOCALIZED
2013-01-30 22:16:05,866 INFO
org.apache.hadoop.yarn.server.nodemanager.containermanager.container.Container:
Container container_1359612879014_0001_01_000001 transitioned from LOCALIZED to
RUNNING
2013-01-30 22:16:05,866 INFO
org.apache.hadoop.yarn.server.nodemanager.containermanager.monitor.ContainersMonitorImpl:
ResourceCalculatorPlugin is unavailable on this system.
org.apache.hadoop.yarn.server.nodemanager.containermanager.monitor.ContainersMonitorImpl
is disabled.
2013-01-30 22:16:05,910 WARN
org.apache.hadoop.yarn.server.nodemanager.containermanager.launcher.ContainerLaunch:
Failed to launch container.
java.io.FileNotFoundException: File
/Users/zshen/Deployment/hadoop-3.0.0-SNAPSHOT/data/nm-local-dir/usercache/zshen/appcache/application_1359612879014_0001/container_1359612879014_0001_01_000001
does not exist
at
org.apache.hadoop.fs.RawLocalFileSystem.getFileStatus(RawLocalFileSystem.java:498)
at org.apache.hadoop.fs.FileSystem.primitiveMkdir(FileSystem.java:996)
at
org.apache.hadoop.fs.DelegateToFileSystem.mkdir(DelegateToFileSystem.java:150)
at org.apache.hadoop.fs.FilterFs.mkdir(FilterFs.java:187)
at org.apache.hadoop.fs.FileContext$4.next(FileContext.java:730)
at org.apache.hadoop.fs.FileContext$4.next(FileContext.java:726)
at
org.apache.hadoop.fs.FileContext$FSLinkResolver.resolve(FileContext.java:2379)
at org.apache.hadoop.fs.FileContext.mkdir(FileContext.java:726)
at
org.apache.hadoop.yarn.server.nodemanager.DefaultContainerExecutor.createDir(DefaultContainerExecutor.java:330)
at
org.apache.hadoop.yarn.server.nodemanager.DefaultContainerExecutor.launchContainer(DefaultContainerExecutor.java:135)
at
org.apache.hadoop.yarn.server.nodemanager.containermanager.launcher.ContainerLaunch.call(ContainerLaunch.java:242)
at
org.apache.hadoop.yarn.server.nodemanager.containermanager.launcher.ContainerLaunch.call(ContainerLaunch.java:68)
at java.util.concurrent.FutureTask$Sync.innerRun(FutureTask.java:303)
at java.util.concurrent.FutureTask.run(FutureTask.java:138)
at
java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:886)
at
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:908)
at java.lang.Thread.run(Thread.java:680)
2013-01-30 22:16:05,913 INFO
org.apache.hadoop.yarn.server.nodemanager.containermanager.container.Container:
Container container_1359612879014_0001_01_000001 transitioned from RUNNING to
EXITED_WITH_FAILURE
2013-01-30 22:16:05,914 INFO
org.apache.hadoop.yarn.server.nodemanager.containermanager.launcher.ContainerLaunch:
Cleaning up container container_1359612879014_0001_01_000001
2013-01-30 22:16:05,934 INFO
org.apache.hadoop.yarn.server.nodemanager.DefaultContainerExecutor: Deleting
absolute path :
/tmp/hadoop-zshen/nm-local-dir/usercache/zshen/appcache/application_1359612879014_0001/container_1359612879014_0001_01_000001
2013-01-30 22:16:05,934 WARN
org.apache.hadoop.yarn.server.nodemanager.NMAuditLogger: USER=zshen
OPERATION=Container Finished - Failed TARGET=ContainerImpl RESULT=FAILURE
DESCRIPTION=Container failed with state: EXITED_WITH_FAILURE
APPID=application_1359612879014_0001
CONTAINERID=container_1359612879014_0001_01_000001
2013-01-30 22:16:05,937 INFO
org.apache.hadoop.yarn.server.nodemanager.containermanager.container.Container:
Container container_1359612879014_0001_01_000001 transitioned from
EXITED_WITH_FAILURE to DONE
2013-01-30 22:16:05,937 INFO
org.apache.hadoop.yarn.server.nodemanager.containermanager.application.Application:
Removing container_1359612879014_0001_01_000001 from application
application_1359612879014_0001
2013-01-30 22:16:05,937 INFO
org.apache.hadoop.yarn.server.nodemanager.containermanager.monitor.ContainersMonitorImpl:
ResourceCalculatorPlugin is unavailable on this system.
org.apache.hadoop.yarn.server.nodemanager.containermanager.monitor.ContainersMonitorImpl
is disabled.
2013-01-30 22:16:05,958 INFO
org.apache.hadoop.yarn.server.nodemanager.NodeStatusUpdaterImpl: Sending out
status for container: container_id {, app_attempt_id {, application_id {, id:
1, cluster_timestamp: 1359612879014, }, attemptId: 1, }, id: 1, }, state:
C_COMPLETE, diagnostics: "", exit_status: -1,
2013-01-30 22:16:05,959 INFO
org.apache.hadoop.yarn.server.nodemanager.NodeStatusUpdaterImpl: Removed
completed container container_1359612879014_0001_01_000001
2013-01-30 22:16:06,965 INFO
org.apache.hadoop.yarn.server.nodemanager.containermanager.application.Application:
Application application_1359612879014_0001 transitioned from RUNNING to
APPLICATION_RESOURCES_CLEANINGUP
2013-01-30 22:16:06,965 INFO
org.apache.hadoop.yarn.server.nodemanager.DefaultContainerExecutor: Deleting
absolute path :
/tmp/hadoop-zshen/nm-local-dir/usercache/zshen/appcache/application_1359612879014_0001
2013-01-30 22:16:06,966 INFO
org.apache.hadoop.yarn.server.nodemanager.containermanager.AuxServices: Got
event APPLICATION_STOP for appId application_1359612879014_0001
2013-01-30 22:16:06,970 INFO
org.apache.hadoop.yarn.server.nodemanager.containermanager.application.Application:
Application application_1359612879014_0001 transitioned from
APPLICATION_RESOURCES_CLEANINGUP to FINISHED
2013-01-30 22:16:06,970 INFO
org.apache.hadoop.yarn.server.nodemanager.containermanager.loghandler.NonAggregatingLogHandler:
Scheduling Log Deletion for application: application_1359612879014_0001, with
delay of 10800 seconds
==========
Below is the setting in hdfs-site.xml.
==========
<property>
<name>hadoop.tmp.dir</name>
<value>/Users/zshen/Deployment/hadoop-3.0.0-SNAPSHOT/data</value>
</property>
==========
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira