Seyyed Ahmad Javadi created YARN-8181:
-----------------------------------------

             Summary: Docker container run_time
                 Key: YARN-8181
                 URL: https://issues.apache.org/jira/browse/YARN-8181
             Project: Hadoop YARN
          Issue Type: Bug
            Reporter: Seyyed Ahmad Javadi


Hi All,

I want to use docker container run time but could not solve the facing problem. 
I am following the guide below and the NM log is as follows. I can not see any 
docker containers to be created. It works when I use default LCE. Please also 
find how I submit a job at the end as well.

May you please let me know how can use LCE binary to make sure my docker setup 
is correct?

I confirmed that "docker run" works fine. I really like this developing feature 
and would like to contribute to it. Many thanks in advance.

https://hadoop.apache.org/docs/current/hadoop-yarn/hadoop-yarn-site/DockerContainers.html
{code:java}
NM LOG:
...
2018-04-19 11:49:24,568 INFO SecurityLogger.org.apache.hadoop.ipc.Server: Auth 
successful for appattempt_1524151293356_0005_000001 (auth:SIMPLE)
2018-04-19 11:49:24,580 INFO 
org.apache.hadoop.yarn.server.nodemanager.containermanager.ContainerManagerImpl:
 Start request for container_1524151293356_0005_01_000001 by user ubuntu
2018-04-19 11:49:24,584 INFO 
org.apache.hadoop.yarn.server.nodemanager.containermanager.ContainerManagerImpl:
 Creating a new application reference for app application_1524151293356_0005
2018-04-19 11:49:24,584 INFO 
org.apache.hadoop.yarn.server.nodemanager.NMAuditLogger: USER=ubuntu    
IP=130.245.127.176    OPERATION=Start Container Request    
TARGET=ContainerManageImpl    RESULT=SUCCESS    
APPID=application_1524151293356_0005    
CONTAINERID=container_1524151293356_0005_01_000001
2018-04-19 11:49:24,585 INFO 
org.apache.hadoop.yarn.server.nodemanager.containermanager.application.ApplicationImpl:
 Application application_1524151293356_0005 transitioned from NEW to INITING
2018-04-19 11:49:24,585 INFO 
org.apache.hadoop.yarn.server.nodemanager.containermanager.application.ApplicationImpl:
 Adding container_1524151293356_0005_01_000001 to application 
application_1524151293356_0005
2018-04-19 11:49:24,585 INFO 
org.apache.hadoop.yarn.server.nodemanager.containermanager.application.ApplicationImpl:
 Application application_1524151293356_0005 transitioned from INITING to RUNNING
2018-04-19 11:49:24,588 INFO 
org.apache.hadoop.yarn.server.nodemanager.containermanager.container.ContainerImpl:
 Container container_1524151293356_0005_01_000001 transitioned from NEW to 
LOCALIZING
2018-04-19 11:49:24,588 INFO 
org.apache.hadoop.yarn.server.nodemanager.containermanager.AuxServices: Got 
event CONTAINER_INIT for appId application_1524151293356_0005
2018-04-19 11:49:24,589 INFO 
org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService:
 Created localizer for container_1524151293356_0005_01_000001
2018-04-19 11:49:24,616 INFO 
org.apache.hadoop.yarn.server.nodemanager.containermanager.localizer.ResourceLocalizationService:
 Writing credentials to the nmPrivate file 
/tmp/hadoop-ubuntu/nm-local-dir/nmPrivate/container_1524151293356_0005_01_000001.tokens
2018-04-19 11:49:28,090 INFO 
org.apache.hadoop.yarn.server.nodemanager.containermanager.container.ContainerImpl:
 Container container_1524151293356_0005_01_000001 transitioned from LOCALIZING 
to SCHEDULED
2018-04-19 11:49:28,090 INFO 
org.apache.hadoop.yarn.server.nodemanager.containermanager.scheduler.ContainerScheduler:
 Starting container [container_1524151293356_0005_01_000001]
2018-04-19 11:49:28,212 INFO 
org.apache.hadoop.yarn.server.nodemanager.containermanager.container.ContainerImpl:
 Container container_1524151293356_0005_01_000001 transitioned from SCHEDULED 
to RUNNING
2018-04-19 11:49:28,212 INFO 
org.apache.hadoop.yarn.server.nodemanager.containermanager.monitor.ContainersMonitorImpl:
 Starting resource-monitoring for container_1524151293356_0005_01_000001
2018-04-19 11:49:29,401 INFO 
org.apache.hadoop.yarn.server.nodemanager.containermanager.launcher.ContainerLaunch:
 Container container_1524151293356_0005_01_000001 succeeded
2018-04-19 11:49:29,401 INFO 
org.apache.hadoop.yarn.server.nodemanager.containermanager.container.ContainerImpl:
 Container container_1524151293356_0005_01_000001 transitioned from RUNNING to 
EXITED_WITH_SUCCESS
2018-04-19 11:49:29,401 INFO 
org.apache.hadoop.yarn.server.nodemanager.containermanager.launcher.ContainerLaunch:
 Cleaning up container container_1524151293356_0005_01_000001
2018-04-19 11:49:29,520 INFO 
org.apache.hadoop.yarn.server.nodemanager.LinuxContainerExecutor: Removing 
Docker container : container_1524151293356_0005_01_000001
2018-04-19 11:49:34,517 INFO 
org.apache.hadoop.yarn.server.nodemanager.containermanager.launcher.ContainerLaunch:
 Could not get pid for container_1524151293356_0005_01_000001. Waited for 5000 
ms.
2018-04-19 11:49:34,517 INFO 
org.apache.hadoop.yarn.server.nodemanager.containermanager.launcher.ContainerLaunch:
 Unable to obtain pid, but docker container request detected. Attempting to 
reap container container_1524151293356_0005_01_000001
2018-04-19 11:49:36,927 INFO 
org.apache.hadoop.yarn.server.nodemanager.LinuxContainerExecutor: Deleting 
absolute path : 
/tmp/hadoop-ubuntu/nm-local-dir/usercache/ubuntu/appcache/application_1524151293356_0005/container_1524151293356_0005_01_000001
2018-04-19 11:49:36,928 INFO 
org.apache.hadoop.yarn.server.nodemanager.NMAuditLogger: USER=ubuntu    
OPERATION=Container Finished - Succeeded    TARGET=ContainerImpl    
RESULT=SUCCESS    APPID=application_1524151293356_0005    
CONTAINERID=container_1524151293356_0005_01_000001
2018-04-19 11:49:36,929 INFO 
org.apache.hadoop.yarn.server.nodemanager.containermanager.container.ContainerImpl:
 Container container_1524151293356_0005_01_000001 transitioned from 
EXITED_WITH_SUCCESS to DONE
2018-04-19 11:49:36,938 INFO 
org.apache.hadoop.yarn.server.nodemanager.containermanager.application.ApplicationImpl:
 Removing container_1524151293356_0005_01_000001 from application 
application_1524151293356_0005
2018-04-19 11:49:36,938 INFO 
org.apache.hadoop.yarn.server.nodemanager.containermanager.monitor.ContainersMonitorImpl:
 Stopping resource-monitoring for container_1524151293356_0005_01_000001
2018-04-19 11:49:36,938 INFO 
org.apache.hadoop.yarn.server.nodemanager.containermanager.AuxServices: Got 
event CONTAINER_STOP for appId application_1524151293356_0005
2018-04-19 11:49:37,941 INFO 
org.apache.hadoop.yarn.server.nodemanager.NodeStatusUpdaterImpl: Removed 
completed containers from NM context: [container_1524151293356_0005_01_000001]
2018-04-19 11:49:50,966 INFO 
org.apache.hadoop.yarn.server.nodemanager.containermanager.application.ApplicationImpl:
 Application application_1524151293356_0005 transitioned from RUNNING to 
APPLICATION_RESOURCES_CLEANINGUP
2018-04-19 11:49:50,967 INFO 
org.apache.hadoop.yarn.server.nodemanager.LinuxContainerExecutor: Deleting 
absolute path : 
/tmp/hadoop-ubuntu/nm-local-dir/usercache/ubuntu/appcache/application_1524151293356_0005
2018-04-19 11:49:50,967 INFO 
org.apache.hadoop.yarn.server.nodemanager.containermanager.AuxServices: Got 
event APPLICATION_STOP for appId application_1524151293356_0005
2018-04-19 11:49:50,967 INFO 
org.apache.hadoop.yarn.server.nodemanager.containermanager.application.ApplicationImpl:
 Application application_1524151293356_0005 transitioned from 
APPLICATION_RESOURCES_CLEANINGUP to FINISHED
2018-04-19 11:49:50,967 INFO 
org.apache.hadoop.yarn.server.nodemanager.containermanager.loghandler.NonAggregatingLogHandler:
 Scheduling Log Deletion for application: application_1524151293356_0005, with 
delay of 10800 seconds

{code}
{code:java}
vars="YARN_CONTAINER_RUNTIME_TYPE=docker,YARN_CONTAINER_RUNTIME_DOCKER_IMAGE=hadoop-ubuntu:latest"

#vars="YARN_CONTAINER_RUNTIME_TYPE=docker,YARN_CONTAINER_RUNTIME_DOCKER_IMAGE=hadoop-ubuntu:latest,YARN_CONTAINER_RUNTIME_DOCKER_RUN_OVERRIDE_DISABLE=false,YARN_CONTAINER_RUNTIME_DOCKER_CONTAINER_NETWORK=host"

#vars="YARN_CONTAINER_RUNTIME_TYPE=default"

hadoop jar 
/home/ubuntu/hadoop-3.1.0/share/hadoop/mapreduce/hadoop-mapreduce-examples-3.1.0.jar
 pi -Dyarn.app.mapreduce.am.env=$vars -Dmapreduce.map.env=$vars 
-Dmapreduce.reduce.env=$vars 2 10
{code}
{code:java}
Number of Maps  = 2
Samples per Map = 10
Wrote input for Map #0
Wrote input for Map #1
Starting Job
2018-04-19 11:49:22,786 INFO client.RMProxy: Connecting to ResourceManager at 
bay1-vm1/130.245.127.176:8032
2018-04-19 11:49:23,435 INFO mapreduce.JobResourceUploader: Disabling Erasure 
Coding for path: /tmp/hadoop-yarn/staging/ubuntu/.staging/job_1524151293356_0005
2018-04-19 11:49:23,601 INFO input.FileInputFormat: Total input files to 
process : 2
2018-04-19 11:49:23,756 INFO mapreduce.JobSubmitter: number of splits:2
2018-04-19 11:49:23,824 INFO Configuration.deprecation: 
yarn.resourcemanager.system-metrics-publisher.enabled is deprecated. Instead, 
use yarn.system-metrics-publisher.enabled
2018-04-19 11:49:24,015 INFO mapreduce.JobSubmitter: Submitting tokens for job: 
job_1524151293356_0005
2018-04-19 11:49:24,017 INFO mapreduce.JobSubmitter: Executing with tokens: []
2018-04-19 11:49:24,262 INFO conf.Configuration: resource-types.xml not found
2018-04-19 11:49:24,262 INFO resource.ResourceUtils: Unable to find 
'resource-types.xml'.
2018-04-19 11:49:24,350 INFO impl.YarnClientImpl: Submitted application 
application_1524151293356_0005
2018-04-19 11:49:24,398 INFO mapreduce.Job: The url to track the job: 
http://bay1-vm1:8088/proxy/application_1524151293356_0005/
2018-04-19 11:49:24,399 INFO mapreduce.Job: Running job: job_1524151293356_0005
2018-04-19 11:49:50,658 INFO mapreduce.Job: Job job_1524151293356_0005 running 
in uber mode : false
2018-04-19 11:49:50,660 INFO mapreduce.Job:  map 0% reduce 0%
2018-04-19 11:49:50,676 INFO mapreduce.Job: Job job_1524151293356_0005 failed 
with state FAILED due to: Application application_1524151293356_0005 failed 2 
times due to AM Container for appattempt_1524151293356_0005_000002 exited with  
exitCode: 0
Failing this attempt.Diagnostics: For more detailed output, check the 
application tracking page: 
http://bay1-vm1:8088/cluster/app/application_1524151293356_0005 Then click on 
links to logs of each attempt.
. Failing the application.
2018-04-19 11:49:50,702 INFO mapreduce.Job: Counters: 0
Job job_1524151293356_0005 failed!
runtime in seconds: 34
{code}
 



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: yarn-dev-unsubscr...@hadoop.apache.org
For additional commands, e-mail: yarn-dev-h...@hadoop.apache.org

Reply via email to