Andrew Onischuk created AMBARI-10013:
----------------------------------------

             Summary: Nodemanager fails to start with error 
java.io.IOException: Linux container executor not configured properly (error=24)
                 Key: AMBARI-10013
                 URL: https://issues.apache.org/jira/browse/AMBARI-10013
             Project: Ambari
          Issue Type: Bug
            Reporter: Andrew Onischuk
            Assignee: Andrew Onischuk
             Fix For: 2.0.0


In Ambari deployed clusters, Nodemanager fails to start with the following
exception for the test case where we write container_executor.cfg

**nodemanager log**
    
    
    
    2015-03-05 20:16:19,431 INFO  nodemanager.NodeManager 
(SignalLogger.java:register(91)) - registered UNIX signal handlers for [TERM, 
HUP, INT]
    2015-03-05 20:16:21,363 WARN  nodemanager.LinuxContainerExecutor 
(LinuxContainerExecutor.java:init(184)) - Exit code from container executor 
initialization is : 24
    ExitCodeException exitCode=24: File /etc/hadoop/conf.empty must be owned by 
root, but is owned by 1005
    
            at org.apache.hadoop.util.Shell.runCommand(Shell.java:538)
            at org.apache.hadoop.util.Shell.run(Shell.java:455)
            at 
org.apache.hadoop.util.Shell$ShellCommandExecutor.execute(Shell.java:715)
            at 
org.apache.hadoop.yarn.server.nodemanager.LinuxContainerExecutor.init(LinuxContainerExecutor.java:181)
            at 
org.apache.hadoop.yarn.server.nodemanager.NodeManager.serviceInit(NodeManager.java:209)
            at 
org.apache.hadoop.service.AbstractService.init(AbstractService.java:163)
            at 
org.apache.hadoop.yarn.server.nodemanager.NodeManager.initAndStartNodeManager(NodeManager.java:462)
            at 
org.apache.hadoop.yarn.server.nodemanager.NodeManager.main(NodeManager.java:509)
    2015-03-05 20:16:21,366 INFO  nodemanager.ContainerExecutor 
(ContainerExecutor.java:logOutput(283)) -
    2015-03-05 20:16:21,367 INFO  service.AbstractService 
(AbstractService.java:noteFailure(272)) - Service NodeManager failed in state 
INITED; cause: org.apache.hadoop.yarn.exceptions.YarnRuntimeException: Failed 
to initialize container executor
    org.apache.hadoop.yarn.exceptions.YarnRuntimeException: Failed to 
initialize container executor
            at 
org.apache.hadoop.yarn.server.nodemanager.NodeManager.serviceInit(NodeManager.java:211)
            at 
org.apache.hadoop.service.AbstractService.init(AbstractService.java:163)
            at 
org.apache.hadoop.yarn.server.nodemanager.NodeManager.initAndStartNodeManager(NodeManager.java:462)
            at 
org.apache.hadoop.yarn.server.nodemanager.NodeManager.main(NodeManager.java:509)
    Caused by: java.io.IOException: Linux container executor not configured 
properly (error=24)
            at 
org.apache.hadoop.yarn.server.nodemanager.LinuxContainerExecutor.init(LinuxContainerExecutor.java:187)
            at 
org.apache.hadoop.yarn.server.nodemanager.NodeManager.serviceInit(NodeManager.java:209)
            ... 3 more
    Caused by: ExitCodeException exitCode=24: File /etc/hadoop/conf.empty must 
be owned by root, but is owned by 1005
    
            at org.apache.hadoop.util.Shell.runCommand(Shell.java:538)
            at org.apache.hadoop.util.Shell.run(Shell.java:455)
            at 
org.apache.hadoop.util.Shell$ShellCommandExecutor.execute(Shell.java:715)
            at 
org.apache.hadoop.yarn.server.nodemanager.LinuxContainerExecutor.init(LinuxContainerExecutor.java:181)
            ... 4 more
    2015-03-05 20:16:21,374 WARN  service.AbstractService 
(ServiceOperations.java:stopQuietly(82)) - When stopping the service 
NodeManager : java.lang.NullPointerException
    java.lang.NullPointerException
            at 
org.apache.hadoop.yarn.server.nodemanager.NodeManager.stopRecoveryStore(NodeManager.java:161)
            at 
org.apache.hadoop.yarn.server.nodemanager.NodeManager.serviceStop(NodeManager.java:273)
            at 
org.apache.hadoop.service.AbstractService.stop(AbstractService.java:221)
            at 
org.apache.hadoop.service.ServiceOperations.stop(ServiceOperations.java:52)
            at 
org.apache.hadoop.service.ServiceOperations.stopQuietly(ServiceOperations.java:80)
            at 
org.apache.hadoop.service.AbstractService.init(AbstractService.java:171)
            at 
org.apache.hadoop.yarn.server.nodemanager.NodeManager.initAndStartNodeManager(NodeManager.java:462)
            at 
org.apache.hadoop.yarn.server.nodemanager.NodeManager.main(NodeManager.java:509)
    2015-03-05 20:16:21,375 FATAL nodemanager.NodeManager 
(NodeManager.java:initAndStartNodeManager(465)) - Error starting NodeManager
    org.apache.hadoop.yarn.exceptions.YarnRuntimeException: Failed to 
initialize container executor
            at 
org.apache.hadoop.yarn.server.nodemanager.NodeManager.serviceInit(NodeManager.java:211)
            at 
org.apache.hadoop.service.AbstractService.init(AbstractService.java:163)
            at 
org.apache.hadoop.yarn.server.nodemanager.NodeManager.initAndStartNodeManager(NodeManager.java:462)
            at 
org.apache.hadoop.yarn.server.nodemanager.NodeManager.main(NodeManager.java:509)
    Caused by: java.io.IOException: Linux container executor not configured 
properly (error=24)
            at 
org.apache.hadoop.yarn.server.nodemanager.LinuxContainerExecutor.init(LinuxContainerExecutor.java:187)
            at 
org.apache.hadoop.yarn.server.nodemanager.NodeManager.serviceInit(NodeManager.java:209)
            ... 3 more
    Caused by: ExitCodeException exitCode=24: File /etc/hadoop/conf.empty must 
be owned by root, but is owned by 1005
    
            at org.apache.hadoop.util.Shell.runCommand(Shell.java:538)
            at org.apache.hadoop.util.Shell.run(Shell.java:455)
            at 
org.apache.hadoop.util.Shell$ShellCommandExecutor.execute(Shell.java:715)
            at 
org.apache.hadoop.yarn.server.nodemanager.LinuxContainerExecutor.init(LinuxContainerExecutor.java:181)
            ... 4 more
    

**container_executor.cfg**
    
    
    
    cat /etc/hadoop/conf/container-executor.cfg
    yarn.nodemanager.local-dirs=/grid/0/hadoop/yarn/local
    yarn.nodemanager.log-dirs=/grid/0/hadoop/yarn/log
    yarn.nodemanager.linux-container-executor.group=hadoop
    banned.users=yarn
    min.user.id=1000
    allowed.system.users=nobody
    

Please note that test code does not touch conf.empty folder

**conf.empty permissions**
    
    
    
    ls -l /etc/hadoop/
    total 4
    lrwxrwxrwx 1 root root     22 Mar  4 21:01 conf -> /etc/hadoop/conf.empty
    drwxr-xr-x 2 hdfs hadoop 4096 Mar  5 19:10 conf.empty
    

Attached are the conf directory, nodemanager logs, RM logs and console logs





--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to