[
https://issues.apache.org/jira/browse/YARN-10048?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sen Zhao updated YARN-10048:
Description:
After manually mounting the Cgroup, the NodeManager fails to start.
If the cpu controller has multiple mount path, only the first mount path will
be returned. This will cause the return value to be not the actual cpu
controller mount path.
{code:java}
2019-12-19 14:46:08,200 INFO
org.apache.hadoop.yarn.server.nodemanager.containermanager.linux.resources.CGroupsHandlerImpl:
Mounting controller cpu at /opt/cgroup/cpu
2019-12-19 14:46:08,290 WARN
org.apache.hadoop.yarn.server.nodemanager.containermanager.linux.privileged.PrivilegedOperationExecutor:
Shell execution returned exit code: 32. Privileged Execution Op
eration Stderr:
Feature disabled: mount cgroup
Stdout:
Full command array for failed execution:
[/opt/hadoop-yarn/bin/container-executor, --mount-cgroups,
yarn-NodeManager/hadoop-yarn, cpu,cpuacct=/opt/cgroup/cpu]
2019-12-19 14:46:08,290 ERROR
org.apache.hadoop.yarn.server.nodemanager.containermanager.linux.resources.CGroupsHandlerImpl:
Failed to mount controller: cpu
2019-12-19 14:46:08,291 ERROR
org.apache.hadoop.yarn.server.nodemanager.LinuxContainerExecutor: Failed to
bootstrap configured resource subsystems!
org.apache.hadoop.yarn.server.nodemanager.containermanager.linux.resources.ResourceHandlerException:
Failed to mount controller: cpu
at
org.apache.hadoop.yarn.server.nodemanager.containermanager.linux.resources.CGroupsHandlerImpl.mountCGroupController(CGroupsHandlerImpl.java:318)
at
org.apache.hadoop.yarn.server.nodemanager.containermanager.linux.resources.CGroupsHandlerImpl.initializeCGroupController(CGroupsHandlerImpl.java:365)
at
org.apache.hadoop.yarn.server.nodemanager.containermanager.linux.resources.CGroupsCpuResourceHandlerImpl.bootstrap(CGroupsCpuResourceHandlerImpl.java:98)
at
org.apache.hadoop.yarn.server.nodemanager.containermanager.linux.resources.CGroupsCpuResourceHandlerImpl.bootstrap(CGroupsCpuResourceHandlerImpl.java:87)
at
org.apache.hadoop.yarn.server.nodemanager.containermanager.linux.resources.ResourceHandlerChain.bootstrap(ResourceHandlerChain.java:58)
at
org.apache.hadoop.yarn.server.nodemanager.LinuxContainerExecutor.init(LinuxContainerExecutor.java:325)
at
org.apache.hadoop.yarn.server.nodemanager.NodeManager.serviceInit(NodeManager.java:403)
at
org.apache.hadoop.service.AbstractService.init(AbstractService.java:164)
at
org.apache.hadoop.yarn.server.nodemanager.NodeManager.initAndStartNodeManager(NodeManager.java:962)
at
org.apache.hadoop.yarn.server.nodemanager.NodeManager.main(NodeManager.java:1042)
{code}
was:
After manually mounting the Cgroup, the NodeManager fails to start.
If the cpu controller has multiple mount path, only the first mount path will
be returned. This will cause the return value to be not the actual cpu
controller mount path.
> NodeManager fails to start after mounting CGroup
>
>
> Key: YARN-10048
> URL: https://issues.apache.org/jira/browse/YARN-10048
> Project: Hadoop YARN
> Issue Type: Bug
>Affects Versions: 3.2.1
>Reporter: Sen Zhao
>Assignee: Sen Zhao
>Priority: Major
> Attachments: YARN-10048.001.patch, YARN-10048.002.patch
>
>
> After manually mounting the Cgroup, the NodeManager fails to start.
> If the cpu controller has multiple mount path, only the first mount path will
> be returned. This will cause the return value to be not the actual cpu
> controller mount path.
> {code:java}
> 2019-12-19 14:46:08,200 INFO
> org.apache.hadoop.yarn.server.nodemanager.containermanager.linux.resources.CGroupsHandlerImpl:
> Mounting controller cpu at /opt/cgroup/cpu
> 2019-12-19 14:46:08,290 WARN
> org.apache.hadoop.yarn.server.nodemanager.containermanager.linux.privileged.PrivilegedOperationExecutor:
> Shell execution returned exit code: 32. Privileged Execution Op
> eration Stderr:
> Feature disabled: mount cgroup
> Stdout:
> Full command array for failed execution:
> [/opt/hadoop-yarn/bin/container-executor, --mount-cgroups,
> yarn-NodeManager/hadoop-yarn, cpu,cpuacct=/opt/cgroup/cpu]
> 2019-12-19 14:46:08,290 ERROR
> org.apache.hadoop.yarn.server.nodemanager.containermanager.linux.resources.CGroupsHandlerImpl:
> Failed to mount controller: cpu
> 2019-12-19 14:46:08,291 ERROR
> org.apache.hadoop.yarn.server.nodemanager.LinuxContainerExecutor: Failed to
> bootstrap configured resource subsystems!
> org.apache.hadoop.yarn.server.nodemanager.containermanager.linux.resources.ResourceHandlerException:
> Failed to mount controller: cpu
> at
> org.apache.hadoop.yarn.server.nodemanager.containermanager.linux.resources.CGroupsHandlerImpl.mountCGroupController(CGroupsHandlerIm