[jira] [Commented] (MESOS-4248) mesos slave can't start in CentOS-7 docker container
[ https://issues.apache.org/jira/browse/MESOS-4248?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15336624#comment-15336624 ] Justin Venus commented on MESOS-4248: - Yes, that is exactly what I want. Thank you for point out the ticket so I didn't have to go Jira spelunking today. > mesos slave can't start in CentOS-7 docker container > > > Key: MESOS-4248 > URL: https://issues.apache.org/jira/browse/MESOS-4248 > Project: Mesos > Issue Type: Bug > Components: slave >Affects Versions: 0.26.0 > Environment: My host OS is Debian Jessie, the container OS is CentOS > 7.2. > {code} > # cat /etc/system-release > CentOS Linux release 7.2.1511 (Core) > # rpm -qa |grep mesos > mesosphere-zookeeper-3.4.6-0.1.20141204175332.centos7.x86_64 > mesosphere-el-repo-7-1.noarch > mesos-0.26.0-0.2.145.centos701406.x86_64 > $ docker version > Client: > Version: 1.9.1 > API version: 1.21 > Go version: go1.4.2 > Git commit: a34a1d5 > Built:Fri Nov 20 12:59:02 UTC 2015 > OS/Arch: linux/amd64 > Server: > Version: 1.9.1 > API version: 1.21 > Go version: go1.4.2 > Git commit: a34a1d5 > Built:Fri Nov 20 12:59:02 UTC 2015 > OS/Arch: linux/amd64 > {code} >Reporter: Yubao Liu > > // Check the "Environment" label above for kinds of software versions. > "systemctl start mesos-slave" can't start mesos-slave: > {code} > # journalctl -u mesos-slave > > Dec 24 10:35:25 mesos-slave1 systemd[1]: Started Mesos Slave. > Dec 24 10:35:25 mesos-slave1 systemd[1]: Starting Mesos Slave... > Dec 24 10:35:25 mesos-slave1 mesos-slave[12845]: I1224 10:35:25.210180 12838 > logging.cpp:172] INFO level logging started! > Dec 24 10:35:25 mesos-slave1 mesos-slave[12845]: I1224 10:35:25.210603 12838 > main.cpp:190] Build: 2015-12-16 23:06:16 by root > Dec 24 10:35:25 mesos-slave1 mesos-slave[12845]: I1224 10:35:25.210625 12838 > main.cpp:192] Version: 0.26.0 > Dec 24 10:35:25 mesos-slave1 mesos-slave[12845]: I1224 10:35:25.210634 12838 > main.cpp:195] Git tag: 0.26.0 > Dec 24 10:35:25 mesos-slave1 mesos-slave[12845]: I1224 10:35:25.210644 12838 > main.cpp:199] Git SHA: d3717e5c4d1bf4fca5c41cd7ea54fae489028faa > Dec 24 10:35:25 mesos-slave1 mesos-slave[12845]: I1224 10:35:25.210765 12838 > containerizer.cpp:142] Using isolation: posix/cpu,posix/mem,filesystem/posix > Dec 24 10:35:25 mesos-slave1 mesos-slave[12845]: I1224 10:35:25.215638 12838 > linux_launcher.cpp:103] Using /sys/fs/cgroup/freezer as the freezer hierarchy > for the Linux launcher > Dec 24 10:35:25 mesos-slave1 mesos-slave[12845]: I1224 10:35:25.220279 12838 > systemd.cpp:128] systemd version `219` detected > Dec 24 10:35:25 mesos-slave1 mesos-slave[12845]: I1224 10:35:25.227017 12838 > systemd.cpp:210] Started systemd slice `mesos_executors.slice` > Dec 24 10:35:25 mesos-slave1 mesos-slave[12845]: Failed to create a > containerizer: Could not create MesosContainerizer: Failed to create > launcher: Failed to locate systemd cgroups hierarchy: does not exist > Dec 24 10:35:25 mesos-slave1 systemd[1]: mesos-slave.service: main process > exited, code=exited, status=1/FAILURE > Dec 24 10:35:25 mesos-slave1 systemd[1]: Unit mesos-slave.service entered > failed state. > Dec 24 10:35:25 mesos-slave1 systemd[1]: mesos-slave.service failed. > {code} > I used strace to debug it, mesos-slave tried to access > "/sys/fs/cgroup/systemd/mesos_executors.slice", but it's actually at > "/sys/fs/cgroup/systemd/system.slice/docker-45875efce9019375cd0c5b29bb1a12275fb6033293f9bf3d97d774a1e5d4de52.scope/mesos_executors.slice/", >mesos-slave should check "/proc/self/cgroup" to find those intermediate > directories: > {code} > # cat /proc/self/cgroup > 8:perf_event:/system.slice/docker-45875efce9019375cd0c5b29bb1a12275fb6033293f9bf3d97d774a1e5d4de52.scope > 7:blkio:/system.slice/docker-45875efce9019375cd0c5b29bb1a12275fb6033293f9bf3d97d774a1e5d4de52.scope > 6:net_cls,net_prio:/system.slice/docker-45875efce9019375cd0c5b29bb1a12275fb6033293f9bf3d97d774a1e5d4de52.scope > 5:freezer:/system.slice/docker-45875efce9019375cd0c5b29bb1a12275fb6033293f9bf3d97d774a1e5d4de52.scope > 4:devices:/system.slice/docker-45875efce9019375cd0c5b29bb1a12275fb6033293f9bf3d97d774a1e5d4de52.scope > 3:cpu,cpuacct:/system.slice/docker-45875efce9019375cd0c5b29bb1a12275fb6033293f9bf3d97d774a1e5d4de52.scope > 2:cpuset:/system.slice/docker-45875efce9019375cd0c5b29bb1a12275fb6033293f9bf3d97d774a1e5d4de52.scope > 1:name=systemd:/system.slice/docker-45875efce9019375cd0c5b29bb1a12275fb6033293f9bf3d97d774a1e5d4de52.scope > {code} -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (MESOS-4248) mesos slave can't start in CentOS-7 docker container
[ https://issues.apache.org/jira/browse/MESOS-4248?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15336464#comment-15336464 ] Joseph Wu commented on MESOS-4248: -- This might be related to what you want: [MESOS-5544]. > mesos slave can't start in CentOS-7 docker container > > > Key: MESOS-4248 > URL: https://issues.apache.org/jira/browse/MESOS-4248 > Project: Mesos > Issue Type: Bug > Components: slave >Affects Versions: 0.26.0 > Environment: My host OS is Debian Jessie, the container OS is CentOS > 7.2. > {code} > # cat /etc/system-release > CentOS Linux release 7.2.1511 (Core) > # rpm -qa |grep mesos > mesosphere-zookeeper-3.4.6-0.1.20141204175332.centos7.x86_64 > mesosphere-el-repo-7-1.noarch > mesos-0.26.0-0.2.145.centos701406.x86_64 > $ docker version > Client: > Version: 1.9.1 > API version: 1.21 > Go version: go1.4.2 > Git commit: a34a1d5 > Built:Fri Nov 20 12:59:02 UTC 2015 > OS/Arch: linux/amd64 > Server: > Version: 1.9.1 > API version: 1.21 > Go version: go1.4.2 > Git commit: a34a1d5 > Built:Fri Nov 20 12:59:02 UTC 2015 > OS/Arch: linux/amd64 > {code} >Reporter: Yubao Liu > > // Check the "Environment" label above for kinds of software versions. > "systemctl start mesos-slave" can't start mesos-slave: > {code} > # journalctl -u mesos-slave > > Dec 24 10:35:25 mesos-slave1 systemd[1]: Started Mesos Slave. > Dec 24 10:35:25 mesos-slave1 systemd[1]: Starting Mesos Slave... > Dec 24 10:35:25 mesos-slave1 mesos-slave[12845]: I1224 10:35:25.210180 12838 > logging.cpp:172] INFO level logging started! > Dec 24 10:35:25 mesos-slave1 mesos-slave[12845]: I1224 10:35:25.210603 12838 > main.cpp:190] Build: 2015-12-16 23:06:16 by root > Dec 24 10:35:25 mesos-slave1 mesos-slave[12845]: I1224 10:35:25.210625 12838 > main.cpp:192] Version: 0.26.0 > Dec 24 10:35:25 mesos-slave1 mesos-slave[12845]: I1224 10:35:25.210634 12838 > main.cpp:195] Git tag: 0.26.0 > Dec 24 10:35:25 mesos-slave1 mesos-slave[12845]: I1224 10:35:25.210644 12838 > main.cpp:199] Git SHA: d3717e5c4d1bf4fca5c41cd7ea54fae489028faa > Dec 24 10:35:25 mesos-slave1 mesos-slave[12845]: I1224 10:35:25.210765 12838 > containerizer.cpp:142] Using isolation: posix/cpu,posix/mem,filesystem/posix > Dec 24 10:35:25 mesos-slave1 mesos-slave[12845]: I1224 10:35:25.215638 12838 > linux_launcher.cpp:103] Using /sys/fs/cgroup/freezer as the freezer hierarchy > for the Linux launcher > Dec 24 10:35:25 mesos-slave1 mesos-slave[12845]: I1224 10:35:25.220279 12838 > systemd.cpp:128] systemd version `219` detected > Dec 24 10:35:25 mesos-slave1 mesos-slave[12845]: I1224 10:35:25.227017 12838 > systemd.cpp:210] Started systemd slice `mesos_executors.slice` > Dec 24 10:35:25 mesos-slave1 mesos-slave[12845]: Failed to create a > containerizer: Could not create MesosContainerizer: Failed to create > launcher: Failed to locate systemd cgroups hierarchy: does not exist > Dec 24 10:35:25 mesos-slave1 systemd[1]: mesos-slave.service: main process > exited, code=exited, status=1/FAILURE > Dec 24 10:35:25 mesos-slave1 systemd[1]: Unit mesos-slave.service entered > failed state. > Dec 24 10:35:25 mesos-slave1 systemd[1]: mesos-slave.service failed. > {code} > I used strace to debug it, mesos-slave tried to access > "/sys/fs/cgroup/systemd/mesos_executors.slice", but it's actually at > "/sys/fs/cgroup/systemd/system.slice/docker-45875efce9019375cd0c5b29bb1a12275fb6033293f9bf3d97d774a1e5d4de52.scope/mesos_executors.slice/", >mesos-slave should check "/proc/self/cgroup" to find those intermediate > directories: > {code} > # cat /proc/self/cgroup > 8:perf_event:/system.slice/docker-45875efce9019375cd0c5b29bb1a12275fb6033293f9bf3d97d774a1e5d4de52.scope > 7:blkio:/system.slice/docker-45875efce9019375cd0c5b29bb1a12275fb6033293f9bf3d97d774a1e5d4de52.scope > 6:net_cls,net_prio:/system.slice/docker-45875efce9019375cd0c5b29bb1a12275fb6033293f9bf3d97d774a1e5d4de52.scope > 5:freezer:/system.slice/docker-45875efce9019375cd0c5b29bb1a12275fb6033293f9bf3d97d774a1e5d4de52.scope > 4:devices:/system.slice/docker-45875efce9019375cd0c5b29bb1a12275fb6033293f9bf3d97d774a1e5d4de52.scope > 3:cpu,cpuacct:/system.slice/docker-45875efce9019375cd0c5b29bb1a12275fb6033293f9bf3d97d774a1e5d4de52.scope > 2:cpuset:/system.slice/docker-45875efce9019375cd0c5b29bb1a12275fb6033293f9bf3d97d774a1e5d4de52.scope > 1:name=systemd:/system.slice/docker-45875efce9019375cd0c5b29bb1a12275fb6033293f9bf3d97d774a1e5d4de52.scope > {code} -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (MESOS-4248) mesos slave can't start in CentOS-7 docker container
[ https://issues.apache.org/jira/browse/MESOS-4248?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15336432#comment-15336432 ] Justin Venus commented on MESOS-4248: - Thanks for pointing that ticket out. However, MESOS-4675 doesn't solve my use case. - I want to run systemd in a docker container - I want mesos-slave to setup the slice "mesos_executors.slice" - I want to use the cgroup isolators - I want mesos-executor tasks to survive a mesos-slave restart - Basically I want mesos-slave to work like it's on bare metal (especially in a docker container I'm carrying around patches for 0.25.0, 0.26.0 and testing 0.27.2 to make this work. I'll open a feature request in jira. Please notice systemd is in a CGroup {code} [root@mesos-slave05of2 /]# systemctl status ● mesos-slave05of2 State: running Jobs: 0 queued Failed: 0 units Since: Wed 2016-06-08 21:41:38 UTC; 1 weeks 1 days ago CGroup: /system.slice/docker-6c53ffcbc602cc6b19149030f6f453a1febd7fc79bf472fa6227c1fecd7c053c.scope ├─1 /usr/lib/systemd/systemd --system --log-target=console --log-level=info --unit=mesos-slave.target ├─mesos_executors.slice │ ├─10139 python2.7 /var/lib/mesos/slaves/f54196a4-d706-4324-97f6-009e18022152-S8/frameworks/20160221-001235-380151 │ ├─10171 /usr/bin/python2.7 /var/lib/mesos/slaves/f54196a4-d706-4324-97f6-009e18022152-S8/frameworks/20160221-0012 │ ├─10282 /usr/bin/python2.7 /var/lib/mesos/slaves/f54196a4-d706-4324-97f6-009e18022152-S8/frameworks/20160221-0012 │ ├─10283 /bin/bash -c echo '#!/bin/bash PEX_INSTALL=${PEX_INSTALL:-${HOME}/.pex/install} LD_LIBRARY_PATH=${LD_LIB │ ├─12647 python2.7 /var/lib/mesos/slaves/f54196a4-d706-4324-97f6-009e18022152-S8/frameworks/20160221-001235-380151 │ ├─12672 /usr/bin/python2.7 /var/lib/mesos/slaves/f54196a4-d706-4324-97f6-009e18022152-S8/frameworks/20160221-0012 │ ├─12690 /usr/bin/python2.7 /var/lib/mesos/slaves/f54196a4-d706-4324-97f6-009e18022152-S8/frameworks/20160221-0012 │ └─12691 python2.7 /var/lib/mesos/slaves/f54196a4-d706-4324-97f6-009e18022152-S8/frameworks/20160221-001235-380151 └─system.slice ├─thermos-observer.service │ └─142 python2.7 /usr/sbin/thermos_observer --mesos-root=/var/lib/mesos --port=1338 --log_to_disk=NONE --log_to_ ├─mesos-slave.service │ ├─ 143 mesos-slave │ ├─ 187 mesos-docker-executor {code} > mesos slave can't start in CentOS-7 docker container > > > Key: MESOS-4248 > URL: https://issues.apache.org/jira/browse/MESOS-4248 > Project: Mesos > Issue Type: Bug > Components: slave >Affects Versions: 0.26.0 > Environment: My host OS is Debian Jessie, the container OS is CentOS > 7.2. > {code} > # cat /etc/system-release > CentOS Linux release 7.2.1511 (Core) > # rpm -qa |grep mesos > mesosphere-zookeeper-3.4.6-0.1.20141204175332.centos7.x86_64 > mesosphere-el-repo-7-1.noarch > mesos-0.26.0-0.2.145.centos701406.x86_64 > $ docker version > Client: > Version: 1.9.1 > API version: 1.21 > Go version: go1.4.2 > Git commit: a34a1d5 > Built:Fri Nov 20 12:59:02 UTC 2015 > OS/Arch: linux/amd64 > Server: > Version: 1.9.1 > API version: 1.21 > Go version: go1.4.2 > Git commit: a34a1d5 > Built:Fri Nov 20 12:59:02 UTC 2015 > OS/Arch: linux/amd64 > {code} >Reporter: Yubao Liu > > // Check the "Environment" label above for kinds of software versions. > "systemctl start mesos-slave" can't start mesos-slave: > {code} > # journalctl -u mesos-slave > > Dec 24 10:35:25 mesos-slave1 systemd[1]: Started Mesos Slave. > Dec 24 10:35:25 mesos-slave1 systemd[1]: Starting Mesos Slave... > Dec 24 10:35:25 mesos-slave1 mesos-slave[12845]: I1224 10:35:25.210180 12838 > logging.cpp:172] INFO level logging started! > Dec 24 10:35:25 mesos-slave1 mesos-slave[12845]: I1224 10:35:25.210603 12838 > main.cpp:190] Build: 2015-12-16 23:06:16 by root > Dec 24 10:35:25 mesos-slave1 mesos-slave[12845]: I1224 10:35:25.210625 12838 > main.cpp:192] Version: 0.26.0 > Dec 24 10:35:25 mesos-slave1 mesos-slave[12845]: I1224 10:35:25.210634 12838 > main.cpp:195] Git tag: 0.26.0 > Dec 24 10:35:25 mesos-slave1 mesos-slave[12845]: I1224 10:35:25.210644 12838 > main.cpp:199] Git SHA: d3717e5c4d1bf4fca5c41cd7ea54fae489028faa > Dec 24 10:35:25 mesos-slave1 mesos-slave[12845]: I1224 10:35:25.210765 12838 > containerizer.cpp:142] Using isolation: posix/cpu,posix/mem,filesystem/posix > Dec 24 10:35:25 mesos-slave1 mesos-slave[12845]: I1224 10:35:25.215638 12838 > linux_launcher.cpp:103] Using /sys/fs/cgroup/freezer as the freezer hierarchy > for the Linux launcher > Dec 24 10:35:25 mesos-slave1 mesos-slave[12845]: I1224
[jira] [Commented] (MESOS-4248) mesos slave can't start in CentOS-7 docker container
[ https://issues.apache.org/jira/browse/MESOS-4248?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15335211#comment-15335211 ] haosdent commented on MESOS-4248: - cc [~jvenus] > mesos slave can't start in CentOS-7 docker container > > > Key: MESOS-4248 > URL: https://issues.apache.org/jira/browse/MESOS-4248 > Project: Mesos > Issue Type: Bug > Components: slave >Affects Versions: 0.26.0 > Environment: My host OS is Debian Jessie, the container OS is CentOS > 7.2. > {code} > # cat /etc/system-release > CentOS Linux release 7.2.1511 (Core) > # rpm -qa |grep mesos > mesosphere-zookeeper-3.4.6-0.1.20141204175332.centos7.x86_64 > mesosphere-el-repo-7-1.noarch > mesos-0.26.0-0.2.145.centos701406.x86_64 > $ docker version > Client: > Version: 1.9.1 > API version: 1.21 > Go version: go1.4.2 > Git commit: a34a1d5 > Built:Fri Nov 20 12:59:02 UTC 2015 > OS/Arch: linux/amd64 > Server: > Version: 1.9.1 > API version: 1.21 > Go version: go1.4.2 > Git commit: a34a1d5 > Built:Fri Nov 20 12:59:02 UTC 2015 > OS/Arch: linux/amd64 > {code} >Reporter: Yubao Liu > > // Check the "Environment" label above for kinds of software versions. > "systemctl start mesos-slave" can't start mesos-slave: > {code} > # journalctl -u mesos-slave > > Dec 24 10:35:25 mesos-slave1 systemd[1]: Started Mesos Slave. > Dec 24 10:35:25 mesos-slave1 systemd[1]: Starting Mesos Slave... > Dec 24 10:35:25 mesos-slave1 mesos-slave[12845]: I1224 10:35:25.210180 12838 > logging.cpp:172] INFO level logging started! > Dec 24 10:35:25 mesos-slave1 mesos-slave[12845]: I1224 10:35:25.210603 12838 > main.cpp:190] Build: 2015-12-16 23:06:16 by root > Dec 24 10:35:25 mesos-slave1 mesos-slave[12845]: I1224 10:35:25.210625 12838 > main.cpp:192] Version: 0.26.0 > Dec 24 10:35:25 mesos-slave1 mesos-slave[12845]: I1224 10:35:25.210634 12838 > main.cpp:195] Git tag: 0.26.0 > Dec 24 10:35:25 mesos-slave1 mesos-slave[12845]: I1224 10:35:25.210644 12838 > main.cpp:199] Git SHA: d3717e5c4d1bf4fca5c41cd7ea54fae489028faa > Dec 24 10:35:25 mesos-slave1 mesos-slave[12845]: I1224 10:35:25.210765 12838 > containerizer.cpp:142] Using isolation: posix/cpu,posix/mem,filesystem/posix > Dec 24 10:35:25 mesos-slave1 mesos-slave[12845]: I1224 10:35:25.215638 12838 > linux_launcher.cpp:103] Using /sys/fs/cgroup/freezer as the freezer hierarchy > for the Linux launcher > Dec 24 10:35:25 mesos-slave1 mesos-slave[12845]: I1224 10:35:25.220279 12838 > systemd.cpp:128] systemd version `219` detected > Dec 24 10:35:25 mesos-slave1 mesos-slave[12845]: I1224 10:35:25.227017 12838 > systemd.cpp:210] Started systemd slice `mesos_executors.slice` > Dec 24 10:35:25 mesos-slave1 mesos-slave[12845]: Failed to create a > containerizer: Could not create MesosContainerizer: Failed to create > launcher: Failed to locate systemd cgroups hierarchy: does not exist > Dec 24 10:35:25 mesos-slave1 systemd[1]: mesos-slave.service: main process > exited, code=exited, status=1/FAILURE > Dec 24 10:35:25 mesos-slave1 systemd[1]: Unit mesos-slave.service entered > failed state. > Dec 24 10:35:25 mesos-slave1 systemd[1]: mesos-slave.service failed. > {code} > I used strace to debug it, mesos-slave tried to access > "/sys/fs/cgroup/systemd/mesos_executors.slice", but it's actually at > "/sys/fs/cgroup/systemd/system.slice/docker-45875efce9019375cd0c5b29bb1a12275fb6033293f9bf3d97d774a1e5d4de52.scope/mesos_executors.slice/", >mesos-slave should check "/proc/self/cgroup" to find those intermediate > directories: > {code} > # cat /proc/self/cgroup > 8:perf_event:/system.slice/docker-45875efce9019375cd0c5b29bb1a12275fb6033293f9bf3d97d774a1e5d4de52.scope > 7:blkio:/system.slice/docker-45875efce9019375cd0c5b29bb1a12275fb6033293f9bf3d97d774a1e5d4de52.scope > 6:net_cls,net_prio:/system.slice/docker-45875efce9019375cd0c5b29bb1a12275fb6033293f9bf3d97d774a1e5d4de52.scope > 5:freezer:/system.slice/docker-45875efce9019375cd0c5b29bb1a12275fb6033293f9bf3d97d774a1e5d4de52.scope > 4:devices:/system.slice/docker-45875efce9019375cd0c5b29bb1a12275fb6033293f9bf3d97d774a1e5d4de52.scope > 3:cpu,cpuacct:/system.slice/docker-45875efce9019375cd0c5b29bb1a12275fb6033293f9bf3d97d774a1e5d4de52.scope > 2:cpuset:/system.slice/docker-45875efce9019375cd0c5b29bb1a12275fb6033293f9bf3d97d774a1e5d4de52.scope > 1:name=systemd:/system.slice/docker-45875efce9019375cd0c5b29bb1a12275fb6033293f9bf3d97d774a1e5d4de52.scope > {code} -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (MESOS-4248) mesos slave can't start in CentOS-7 docker container
[ https://issues.apache.org/jira/browse/MESOS-4248?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15335119#comment-15335119 ] Justin Venus commented on MESOS-4248: - I can reproduce this issue on CoreOS and Centos7 hosts. The container image I use is centos7 with systemd to supervising the mesos-slave process. The versions of I've observed with this issue are mesos 0.25.0 and 0.26.0. The patch above works for 0.26.0. I'm about to upgrade to 0.27.2 and will report back. Also: I have run a mesos-slave container on ubuntu 12.04 with this patch and have observed no negative side effects. Here is part of my build setup that could be used reproduce the issue, you'll need to remove some stuff from the Dockerfile. https://gist.github.com/anonymous/66fc10436c2c381c2aea6dda122ce205 > mesos slave can't start in CentOS-7 docker container > > > Key: MESOS-4248 > URL: https://issues.apache.org/jira/browse/MESOS-4248 > Project: Mesos > Issue Type: Bug > Components: slave >Affects Versions: 0.26.0 > Environment: My host OS is Debian Jessie, the container OS is CentOS > 7.2. > {code} > # cat /etc/system-release > CentOS Linux release 7.2.1511 (Core) > # rpm -qa |grep mesos > mesosphere-zookeeper-3.4.6-0.1.20141204175332.centos7.x86_64 > mesosphere-el-repo-7-1.noarch > mesos-0.26.0-0.2.145.centos701406.x86_64 > $ docker version > Client: > Version: 1.9.1 > API version: 1.21 > Go version: go1.4.2 > Git commit: a34a1d5 > Built:Fri Nov 20 12:59:02 UTC 2015 > OS/Arch: linux/amd64 > Server: > Version: 1.9.1 > API version: 1.21 > Go version: go1.4.2 > Git commit: a34a1d5 > Built:Fri Nov 20 12:59:02 UTC 2015 > OS/Arch: linux/amd64 > {code} >Reporter: Yubao Liu > > // Check the "Environment" label above for kinds of software versions. > "systemctl start mesos-slave" can't start mesos-slave: > {code} > # journalctl -u mesos-slave > > Dec 24 10:35:25 mesos-slave1 systemd[1]: Started Mesos Slave. > Dec 24 10:35:25 mesos-slave1 systemd[1]: Starting Mesos Slave... > Dec 24 10:35:25 mesos-slave1 mesos-slave[12845]: I1224 10:35:25.210180 12838 > logging.cpp:172] INFO level logging started! > Dec 24 10:35:25 mesos-slave1 mesos-slave[12845]: I1224 10:35:25.210603 12838 > main.cpp:190] Build: 2015-12-16 23:06:16 by root > Dec 24 10:35:25 mesos-slave1 mesos-slave[12845]: I1224 10:35:25.210625 12838 > main.cpp:192] Version: 0.26.0 > Dec 24 10:35:25 mesos-slave1 mesos-slave[12845]: I1224 10:35:25.210634 12838 > main.cpp:195] Git tag: 0.26.0 > Dec 24 10:35:25 mesos-slave1 mesos-slave[12845]: I1224 10:35:25.210644 12838 > main.cpp:199] Git SHA: d3717e5c4d1bf4fca5c41cd7ea54fae489028faa > Dec 24 10:35:25 mesos-slave1 mesos-slave[12845]: I1224 10:35:25.210765 12838 > containerizer.cpp:142] Using isolation: posix/cpu,posix/mem,filesystem/posix > Dec 24 10:35:25 mesos-slave1 mesos-slave[12845]: I1224 10:35:25.215638 12838 > linux_launcher.cpp:103] Using /sys/fs/cgroup/freezer as the freezer hierarchy > for the Linux launcher > Dec 24 10:35:25 mesos-slave1 mesos-slave[12845]: I1224 10:35:25.220279 12838 > systemd.cpp:128] systemd version `219` detected > Dec 24 10:35:25 mesos-slave1 mesos-slave[12845]: I1224 10:35:25.227017 12838 > systemd.cpp:210] Started systemd slice `mesos_executors.slice` > Dec 24 10:35:25 mesos-slave1 mesos-slave[12845]: Failed to create a > containerizer: Could not create MesosContainerizer: Failed to create > launcher: Failed to locate systemd cgroups hierarchy: does not exist > Dec 24 10:35:25 mesos-slave1 systemd[1]: mesos-slave.service: main process > exited, code=exited, status=1/FAILURE > Dec 24 10:35:25 mesos-slave1 systemd[1]: Unit mesos-slave.service entered > failed state. > Dec 24 10:35:25 mesos-slave1 systemd[1]: mesos-slave.service failed. > {code} > I used strace to debug it, mesos-slave tried to access > "/sys/fs/cgroup/systemd/mesos_executors.slice", but it's actually at > "/sys/fs/cgroup/systemd/system.slice/docker-45875efce9019375cd0c5b29bb1a12275fb6033293f9bf3d97d774a1e5d4de52.scope/mesos_executors.slice/", >mesos-slave should check "/proc/self/cgroup" to find those intermediate > directories: > {code} > # cat /proc/self/cgroup > 8:perf_event:/system.slice/docker-45875efce9019375cd0c5b29bb1a12275fb6033293f9bf3d97d774a1e5d4de52.scope > 7:blkio:/system.slice/docker-45875efce9019375cd0c5b29bb1a12275fb6033293f9bf3d97d774a1e5d4de52.scope > 6:net_cls,net_prio:/system.slice/docker-45875efce9019375cd0c5b29bb1a12275fb6033293f9bf3d97d774a1e5d4de52.scope > 5:freezer:/system.slice/docker-45875efce9019375cd0c5b29bb1a12275fb6033293f9bf3d97d774a1e5d4de52.scope > 4:devices:/system.slice/docker-45875efce9019375cd0c5b29bb1a12275fb6033293f9bf3d97d774a1e5d4de52.scope >
[jira] [Commented] (MESOS-4248) mesos slave can't start in CentOS-7 docker container
[ https://issues.apache.org/jira/browse/MESOS-4248?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15297391#comment-15297391 ] Shane da Silva commented on MESOS-4248: --- FWIW, we can't reproduce this issue on Mesos 0.26.0, but we do hit it on 0.27.2 and 0.28.1. It would be great for someone to review Yubao's patch and consider merging it, as it's convenient to be able to run Mesos in a container for integration testing. For example, in the Chef ecosystem using test-kitchen with kitchen-docker to quickly spin up pseudo-"VMs" is common practice. [~liuyb]: did you by chance ever find a workaround for this issue? > mesos slave can't start in CentOS-7 docker container > > > Key: MESOS-4248 > URL: https://issues.apache.org/jira/browse/MESOS-4248 > Project: Mesos > Issue Type: Bug > Components: slave >Affects Versions: 0.26.0 > Environment: My host OS is Debian Jessie, the container OS is CentOS > 7.2. > {code} > # cat /etc/system-release > CentOS Linux release 7.2.1511 (Core) > # rpm -qa |grep mesos > mesosphere-zookeeper-3.4.6-0.1.20141204175332.centos7.x86_64 > mesosphere-el-repo-7-1.noarch > mesos-0.26.0-0.2.145.centos701406.x86_64 > $ docker version > Client: > Version: 1.9.1 > API version: 1.21 > Go version: go1.4.2 > Git commit: a34a1d5 > Built:Fri Nov 20 12:59:02 UTC 2015 > OS/Arch: linux/amd64 > Server: > Version: 1.9.1 > API version: 1.21 > Go version: go1.4.2 > Git commit: a34a1d5 > Built:Fri Nov 20 12:59:02 UTC 2015 > OS/Arch: linux/amd64 > {code} >Reporter: Yubao Liu > > // Check the "Environment" label above for kinds of software versions. > "systemctl start mesos-slave" can't start mesos-slave: > {code} > # journalctl -u mesos-slave > > Dec 24 10:35:25 mesos-slave1 systemd[1]: Started Mesos Slave. > Dec 24 10:35:25 mesos-slave1 systemd[1]: Starting Mesos Slave... > Dec 24 10:35:25 mesos-slave1 mesos-slave[12845]: I1224 10:35:25.210180 12838 > logging.cpp:172] INFO level logging started! > Dec 24 10:35:25 mesos-slave1 mesos-slave[12845]: I1224 10:35:25.210603 12838 > main.cpp:190] Build: 2015-12-16 23:06:16 by root > Dec 24 10:35:25 mesos-slave1 mesos-slave[12845]: I1224 10:35:25.210625 12838 > main.cpp:192] Version: 0.26.0 > Dec 24 10:35:25 mesos-slave1 mesos-slave[12845]: I1224 10:35:25.210634 12838 > main.cpp:195] Git tag: 0.26.0 > Dec 24 10:35:25 mesos-slave1 mesos-slave[12845]: I1224 10:35:25.210644 12838 > main.cpp:199] Git SHA: d3717e5c4d1bf4fca5c41cd7ea54fae489028faa > Dec 24 10:35:25 mesos-slave1 mesos-slave[12845]: I1224 10:35:25.210765 12838 > containerizer.cpp:142] Using isolation: posix/cpu,posix/mem,filesystem/posix > Dec 24 10:35:25 mesos-slave1 mesos-slave[12845]: I1224 10:35:25.215638 12838 > linux_launcher.cpp:103] Using /sys/fs/cgroup/freezer as the freezer hierarchy > for the Linux launcher > Dec 24 10:35:25 mesos-slave1 mesos-slave[12845]: I1224 10:35:25.220279 12838 > systemd.cpp:128] systemd version `219` detected > Dec 24 10:35:25 mesos-slave1 mesos-slave[12845]: I1224 10:35:25.227017 12838 > systemd.cpp:210] Started systemd slice `mesos_executors.slice` > Dec 24 10:35:25 mesos-slave1 mesos-slave[12845]: Failed to create a > containerizer: Could not create MesosContainerizer: Failed to create > launcher: Failed to locate systemd cgroups hierarchy: does not exist > Dec 24 10:35:25 mesos-slave1 systemd[1]: mesos-slave.service: main process > exited, code=exited, status=1/FAILURE > Dec 24 10:35:25 mesos-slave1 systemd[1]: Unit mesos-slave.service entered > failed state. > Dec 24 10:35:25 mesos-slave1 systemd[1]: mesos-slave.service failed. > {code} > I used strace to debug it, mesos-slave tried to access > "/sys/fs/cgroup/systemd/mesos_executors.slice", but it's actually at > "/sys/fs/cgroup/systemd/system.slice/docker-45875efce9019375cd0c5b29bb1a12275fb6033293f9bf3d97d774a1e5d4de52.scope/mesos_executors.slice/", >mesos-slave should check "/proc/self/cgroup" to find those intermediate > directories: > {code} > # cat /proc/self/cgroup > 8:perf_event:/system.slice/docker-45875efce9019375cd0c5b29bb1a12275fb6033293f9bf3d97d774a1e5d4de52.scope > 7:blkio:/system.slice/docker-45875efce9019375cd0c5b29bb1a12275fb6033293f9bf3d97d774a1e5d4de52.scope > 6:net_cls,net_prio:/system.slice/docker-45875efce9019375cd0c5b29bb1a12275fb6033293f9bf3d97d774a1e5d4de52.scope > 5:freezer:/system.slice/docker-45875efce9019375cd0c5b29bb1a12275fb6033293f9bf3d97d774a1e5d4de52.scope > 4:devices:/system.slice/docker-45875efce9019375cd0c5b29bb1a12275fb6033293f9bf3d97d774a1e5d4de52.scope > 3:cpu,cpuacct:/system.slice/docker-45875efce9019375cd0c5b29bb1a12275fb6033293f9bf3d97d774a1e5d4de52.scope > 2:cpuset:/system.slice/docker-45875efce9019375cd0c5b29bb1a12275fb6033293f9bf3d97d774a1e5d4de52.scope >
[jira] [Commented] (MESOS-4248) mesos slave can't start in CentOS-7 docker container
[ https://issues.apache.org/jira/browse/MESOS-4248?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15072671#comment-15072671 ] Yubao Liu commented on MESOS-4248: -- Got a patch to fix this issue: https://github.com/Dieken/mesos/commit/0eac426fcca6b2a7ff3b74bc2dc319cbbe4edb20 Please have a look first, I'm fine to register a ReviewBoard account to go through the formal review process, or you can just do your version, never mind :-) > mesos slave can't start in CentOS-7 docker container > > > Key: MESOS-4248 > URL: https://issues.apache.org/jira/browse/MESOS-4248 > Project: Mesos > Issue Type: Bug > Components: slave >Affects Versions: 0.26.0 > Environment: My host OS is Debian Jessie, the container OS is CentOS > 7.2. > {code} > # cat /etc/system-release > CentOS Linux release 7.2.1511 (Core) > # rpm -qa |grep mesos > mesosphere-zookeeper-3.4.6-0.1.20141204175332.centos7.x86_64 > mesosphere-el-repo-7-1.noarch > mesos-0.26.0-0.2.145.centos701406.x86_64 > $ docker version > Client: > Version: 1.9.1 > API version: 1.21 > Go version: go1.4.2 > Git commit: a34a1d5 > Built:Fri Nov 20 12:59:02 UTC 2015 > OS/Arch: linux/amd64 > Server: > Version: 1.9.1 > API version: 1.21 > Go version: go1.4.2 > Git commit: a34a1d5 > Built:Fri Nov 20 12:59:02 UTC 2015 > OS/Arch: linux/amd64 > {code} >Reporter: Yubao Liu > > // Check the "Environment" label above for kinds of software versions. > "systemctl start mesos-slave" can't start mesos-slave: > {code} > # journalctl -u mesos-slave > > Dec 24 10:35:25 mesos-slave1 systemd[1]: Started Mesos Slave. > Dec 24 10:35:25 mesos-slave1 systemd[1]: Starting Mesos Slave... > Dec 24 10:35:25 mesos-slave1 mesos-slave[12845]: I1224 10:35:25.210180 12838 > logging.cpp:172] INFO level logging started! > Dec 24 10:35:25 mesos-slave1 mesos-slave[12845]: I1224 10:35:25.210603 12838 > main.cpp:190] Build: 2015-12-16 23:06:16 by root > Dec 24 10:35:25 mesos-slave1 mesos-slave[12845]: I1224 10:35:25.210625 12838 > main.cpp:192] Version: 0.26.0 > Dec 24 10:35:25 mesos-slave1 mesos-slave[12845]: I1224 10:35:25.210634 12838 > main.cpp:195] Git tag: 0.26.0 > Dec 24 10:35:25 mesos-slave1 mesos-slave[12845]: I1224 10:35:25.210644 12838 > main.cpp:199] Git SHA: d3717e5c4d1bf4fca5c41cd7ea54fae489028faa > Dec 24 10:35:25 mesos-slave1 mesos-slave[12845]: I1224 10:35:25.210765 12838 > containerizer.cpp:142] Using isolation: posix/cpu,posix/mem,filesystem/posix > Dec 24 10:35:25 mesos-slave1 mesos-slave[12845]: I1224 10:35:25.215638 12838 > linux_launcher.cpp:103] Using /sys/fs/cgroup/freezer as the freezer hierarchy > for the Linux launcher > Dec 24 10:35:25 mesos-slave1 mesos-slave[12845]: I1224 10:35:25.220279 12838 > systemd.cpp:128] systemd version `219` detected > Dec 24 10:35:25 mesos-slave1 mesos-slave[12845]: I1224 10:35:25.227017 12838 > systemd.cpp:210] Started systemd slice `mesos_executors.slice` > Dec 24 10:35:25 mesos-slave1 mesos-slave[12845]: Failed to create a > containerizer: Could not create MesosContainerizer: Failed to create > launcher: Failed to locate systemd cgroups hierarchy: does not exist > Dec 24 10:35:25 mesos-slave1 systemd[1]: mesos-slave.service: main process > exited, code=exited, status=1/FAILURE > Dec 24 10:35:25 mesos-slave1 systemd[1]: Unit mesos-slave.service entered > failed state. > Dec 24 10:35:25 mesos-slave1 systemd[1]: mesos-slave.service failed. > {code} > I used strace to debug it, mesos-slave tried to access > "/sys/fs/cgroup/systemd/mesos_executors.slice", but it's actually at > "/sys/fs/cgroup/systemd/system.slice/docker-45875efce9019375cd0c5b29bb1a12275fb6033293f9bf3d97d774a1e5d4de52.scope/mesos_executors.slice/", >mesos-slave should check "/proc/self/cgroup" to find those intermediate > directories: > {code} > # cat /proc/self/cgroup > 8:perf_event:/system.slice/docker-45875efce9019375cd0c5b29bb1a12275fb6033293f9bf3d97d774a1e5d4de52.scope > 7:blkio:/system.slice/docker-45875efce9019375cd0c5b29bb1a12275fb6033293f9bf3d97d774a1e5d4de52.scope > 6:net_cls,net_prio:/system.slice/docker-45875efce9019375cd0c5b29bb1a12275fb6033293f9bf3d97d774a1e5d4de52.scope > 5:freezer:/system.slice/docker-45875efce9019375cd0c5b29bb1a12275fb6033293f9bf3d97d774a1e5d4de52.scope > 4:devices:/system.slice/docker-45875efce9019375cd0c5b29bb1a12275fb6033293f9bf3d97d774a1e5d4de52.scope > 3:cpu,cpuacct:/system.slice/docker-45875efce9019375cd0c5b29bb1a12275fb6033293f9bf3d97d774a1e5d4de52.scope > 2:cpuset:/system.slice/docker-45875efce9019375cd0c5b29bb1a12275fb6033293f9bf3d97d774a1e5d4de52.scope > 1:name=systemd:/system.slice/docker-45875efce9019375cd0c5b29bb1a12275fb6033293f9bf3d97d774a1e5d4de52.scope > {code} -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (MESOS-4248) mesos slave can't start in CentOS-7 docker container
[ https://issues.apache.org/jira/browse/MESOS-4248?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15070890#comment-15070890 ] Yubao Liu commented on MESOS-4248: -- The command to start container: {code} docker run --cap-add SYS_ADMIN -e container=docker -dt centos:7 bash -c 'mount | grep /sys/fs/cgroup/ | awk "{print \$3}" | xargs -n 1 umount; exec /usr/sbin/init' {code} > mesos slave can't start in CentOS-7 docker container > > > Key: MESOS-4248 > URL: https://issues.apache.org/jira/browse/MESOS-4248 > Project: Mesos > Issue Type: Bug > Components: slave >Affects Versions: 0.26.0 > Environment: My host OS is Debian Jessie, the container OS is CentOS > 7.2. > {code} > # cat /etc/system-release > CentOS Linux release 7.2.1511 (Core) > # rpm -qa |grep mesos > mesosphere-zookeeper-3.4.6-0.1.20141204175332.centos7.x86_64 > mesosphere-el-repo-7-1.noarch > mesos-0.26.0-0.2.145.centos701406.x86_64 > $ docker version > Client: > Version: 1.9.1 > API version: 1.21 > Go version: go1.4.2 > Git commit: a34a1d5 > Built:Fri Nov 20 12:59:02 UTC 2015 > OS/Arch: linux/amd64 > Server: > Version: 1.9.1 > API version: 1.21 > Go version: go1.4.2 > Git commit: a34a1d5 > Built:Fri Nov 20 12:59:02 UTC 2015 > OS/Arch: linux/amd64 > {code} >Reporter: Yubao Liu > > // Check the "Environment" label above for kinds of software versions. > "systemctl start mesos-slave" can't start mesos-slave: > {code} > # journalctl -u mesos-slave > > Dec 24 10:35:25 mesos-slave1 systemd[1]: Started Mesos Slave. > Dec 24 10:35:25 mesos-slave1 systemd[1]: Starting Mesos Slave... > Dec 24 10:35:25 mesos-slave1 mesos-slave[12845]: I1224 10:35:25.210180 12838 > logging.cpp:172] INFO level logging started! > Dec 24 10:35:25 mesos-slave1 mesos-slave[12845]: I1224 10:35:25.210603 12838 > main.cpp:190] Build: 2015-12-16 23:06:16 by root > Dec 24 10:35:25 mesos-slave1 mesos-slave[12845]: I1224 10:35:25.210625 12838 > main.cpp:192] Version: 0.26.0 > Dec 24 10:35:25 mesos-slave1 mesos-slave[12845]: I1224 10:35:25.210634 12838 > main.cpp:195] Git tag: 0.26.0 > Dec 24 10:35:25 mesos-slave1 mesos-slave[12845]: I1224 10:35:25.210644 12838 > main.cpp:199] Git SHA: d3717e5c4d1bf4fca5c41cd7ea54fae489028faa > Dec 24 10:35:25 mesos-slave1 mesos-slave[12845]: I1224 10:35:25.210765 12838 > containerizer.cpp:142] Using isolation: posix/cpu,posix/mem,filesystem/posix > Dec 24 10:35:25 mesos-slave1 mesos-slave[12845]: I1224 10:35:25.215638 12838 > linux_launcher.cpp:103] Using /sys/fs/cgroup/freezer as the freezer hierarchy > for the Linux launcher > Dec 24 10:35:25 mesos-slave1 mesos-slave[12845]: I1224 10:35:25.220279 12838 > systemd.cpp:128] systemd version `219` detected > Dec 24 10:35:25 mesos-slave1 mesos-slave[12845]: I1224 10:35:25.227017 12838 > systemd.cpp:210] Started systemd slice `mesos_executors.slice` > Dec 24 10:35:25 mesos-slave1 mesos-slave[12845]: Failed to create a > containerizer: Could not create MesosContainerizer: Failed to create > launcher: Failed to locate systemd cgroups hierarchy: does not exist > Dec 24 10:35:25 mesos-slave1 systemd[1]: mesos-slave.service: main process > exited, code=exited, status=1/FAILURE > Dec 24 10:35:25 mesos-slave1 systemd[1]: Unit mesos-slave.service entered > failed state. > Dec 24 10:35:25 mesos-slave1 systemd[1]: mesos-slave.service failed. > {code} > I used strace to debug it, mesos-slave tried to access > "/sys/fs/cgroup/systemd/mesos_executors.slice", but it's actually at > "/sys/fs/cgroup/systemd/system.slice/docker-45875efce9019375cd0c5b29bb1a12275fb6033293f9bf3d97d774a1e5d4de52.scope/mesos_executors.slice/", >mesos-slave should check "/proc/self/cgroup" to find those intermediate > directories: > {code} > # cat /proc/self/cgroup > 8:perf_event:/system.slice/docker-45875efce9019375cd0c5b29bb1a12275fb6033293f9bf3d97d774a1e5d4de52.scope > 7:blkio:/system.slice/docker-45875efce9019375cd0c5b29bb1a12275fb6033293f9bf3d97d774a1e5d4de52.scope > 6:net_cls,net_prio:/system.slice/docker-45875efce9019375cd0c5b29bb1a12275fb6033293f9bf3d97d774a1e5d4de52.scope > 5:freezer:/system.slice/docker-45875efce9019375cd0c5b29bb1a12275fb6033293f9bf3d97d774a1e5d4de52.scope > 4:devices:/system.slice/docker-45875efce9019375cd0c5b29bb1a12275fb6033293f9bf3d97d774a1e5d4de52.scope > 3:cpu,cpuacct:/system.slice/docker-45875efce9019375cd0c5b29bb1a12275fb6033293f9bf3d97d774a1e5d4de52.scope > 2:cpuset:/system.slice/docker-45875efce9019375cd0c5b29bb1a12275fb6033293f9bf3d97d774a1e5d4de52.scope > 1:name=systemd:/system.slice/docker-45875efce9019375cd0c5b29bb1a12275fb6033293f9bf3d97d774a1e5d4de52.scope > {code} -- This message was sent by Atlassian JIRA (v6.3.4#6332)