Thank Vinod,

Yeah, I wasn't at all clear on how to set up cgroups for Mesos.

I'll try shutting down cgconfig, cleaning out /cgroup, and seeing what
happens when I start the slave.

If that doesn't work, I'll grab 0.180-rc5 and give that a spin.

Tony


On Fri, Mar 28, 2014 at 1:35 PM, Vinod Kone <[email protected]> wrote:

> Hey Tony,
>
> I don't think Mesos would work nicely if you have multiple hierarchies
> (one per controller) pre 0.18.0. One solution is to un mount and remove all
> the cgroup hierarchies and let Mesos take control of it.
>
> Alternatively, since 0.18.0, we did a major refactor around cgroups
> isolation and it works with multiple hierarchies. So you might want to give
> it (0.18.0-rc5) a try.
>
>
> @vinodkone
>
>
> On Fri, Mar 28, 2014 at 11:05 AM, Tony Landells <[email protected]> wrote:
>
>> Hi,
>>
>> I'm trying to get Mesos up as a PoC at work to see if it's something we
>> could use effectively.
>>
>> I'm running SL 6.4, installed libcgroup and ran cgconfig which creates
>> the cgroup mounts under /cgroup:
>>
>> [[email protected] ~]$ cat /proc/mounts | grep cgroup
>>> cgroup /cgroup/cpuset cgroup rw,relatime,cpuset 0 0
>>> cgroup /cgroup/cpu cgroup rw,relatime,cpu 0 0
>>> cgroup /cgroup/cpuacct cgroup rw,relatime,cpuacct 0 0
>>> cgroup /cgroup/memory cgroup rw,relatime,memory 0 0
>>> cgroup /cgroup/devices cgroup rw,relatime,devices 0 0
>>> cgroup /cgroup/freezer cgroup rw,relatime,freezer 0 0
>>> cgroup /cgroup/net_cls cgroup rw,relatime,net_cls 0 0
>>> cgroup /cgroup/blkio cgroup rw,relatime,blkio 0 0
>>
>>
>> I'm using the mesos_0.17.0_x86_64.rpm package downloaded from
>> mesosphere.io<https://urldefense.proofpoint.com/v1/url?u=http://mesosphere.io&k=ppTtPKfaP99D%2F%2BB0F9qgBA%3D%3D%0A&r=x28Jl7CpmSGov0XieaueXQ%3D%3D%0A&m=4tTsrmnaFBZhAevZGQMu7MERiC%2BYL3QdI7iTMUIFqik%3D%0A&s=1534ed849c30c9cc1f489ea1b0dab2858f004a45101fc4b70503337848dd597b>and
>>  I can't start a slave fine if I use process isolation, but if I try
>> cgroups I get the following in /var/log/messages:
>>
>> 2014-03-28T10:46:05.194958-07:00 tlandells-mesos02 mesos-slave[6242]:
>>> I0328 10:46:05.194540  6242 main.cpp:118] Build: 2014-02-27 15:07:00 by root
>>> 2014-03-28T10:46:05.194976-07:00 tlandells-mesos02 mesos-slave[6242]:
>>> I0328 10:46:05.194810  6242 main.cpp:120] Creating "cgroups" isolator
>>> 2014-03-28T10:46:05.195267-07:00 tlandells-mesos02 mesos-slave[6242]:
>>> I0328 10:46:05.195050  6242 main.cpp:132] Starting Mesos slave
>>> 2014-03-28T10:46:05.195657-07:00 tlandells-mesos02 mesos-slave[6242]:
>>> 2014-03-28 10:46:05,195:6242(0x7fc5fafcb700):ZOO_INFO@log_env@712:
>>> Client environment:zookeeper.version=zookeeper C client 3.4.5
>>> 2014-03-28T10:46:05.195667-07:00 tlandells-mesos02 mesos-slave[6242]:
>>> 2014-03-28 10:46:05,195:6242(0x7fc5fafcb700):ZOO_INFO@log_env@716:
>>> Client 
>>> environment:host.name<https://urldefense.proofpoint.com/v1/url?u=http://host.name&k=ppTtPKfaP99D%2F%2BB0F9qgBA%3D%3D%0A&r=x28Jl7CpmSGov0XieaueXQ%3D%3D%0A&m=4tTsrmnaFBZhAevZGQMu7MERiC%2BYL3QdI7iTMUIFqik%3D%0A&s=e22c1990921e7300f1890ae19be345c0374fbe0fee025f89892acefe879af22d>
>>> =tlandells-mesos02.dev.box.net.
>>> 2014-03-28T10:46:05.195671-07:00 tlandells-mesos02 mesos-slave[6242]:
>>> 2014-03-28 10:46:05,195:6242(0x7fc5fafcb700):ZOO_INFO@log_env@723:
>>> Client 
>>> environment:os.name<https://urldefense.proofpoint.com/v1/url?u=http://os.name&k=ppTtPKfaP99D%2F%2BB0F9qgBA%3D%3D%0A&r=x28Jl7CpmSGov0XieaueXQ%3D%3D%0A&m=4tTsrmnaFBZhAevZGQMu7MERiC%2BYL3QdI7iTMUIFqik%3D%0A&s=0f5039fb6bfae82a50cc0ad2d01e809483cd318b3a96daf432d471f773a44430>
>>> =Linux
>>> 2014-03-28T10:46:05.195674-07:00 tlandells-mesos02 mesos-slave[6242]:
>>> 2014-03-28 10:46:05,195:6242(0x7fc5fafcb700):ZOO_INFO@log_env@724:
>>> Client environment:os.arch=2.6.32-358.18.1.el6.x86_64
>>> 2014-03-28T10:46:05.195678-07:00 tlandells-mesos02 mesos-slave[6242]:
>>> I0328 10:46:05.195552  6242 slave.cpp:111] Slave started on 1)@
>>> 10.3.18.184:5051<https://urldefense.proofpoint.com/v1/url?u=http://10.3.18.184:5051&k=ppTtPKfaP99D%2F%2BB0F9qgBA%3D%3D%0A&r=x28Jl7CpmSGov0XieaueXQ%3D%3D%0A&m=4tTsrmnaFBZhAevZGQMu7MERiC%2BYL3QdI7iTMUIFqik%3D%0A&s=56e0996f695897eda97193ede5e6c3bc983f8c41eb5d70f9c7c1bd3e9838cfa6>
>>> 2014-03-28T10:46:05.195681-07:00 tlandells-mesos02 mesos-slave[6242]:
>>> 2014-03-28 10:46:05,195:6242(0x7fc5fafcb700):ZOO_INFO@log_env@725:
>>> Client environment:os.version=#1 SMP Tue Aug 27 14:23:09 CDT 2013
>>> 2014-03-28T10:46:05.195773-07:00 tlandells-mesos02 mesos-slave[6242]:
>>> I0328 10:46:05.195708  6242 slave.cpp:211] Slave resources: cpus(*):2;
>>> mem(*):2808; disk(*):43161; ports(*):[31000-32000]
>>> 2014-03-28T10:46:05.196105-07:00 tlandells-mesos02 mesos-slave[6242]:
>>> 2014-03-28 10:46:05,195:6242(0x7fc5fafcb700):ZOO_INFO@log_env@733:
>>> Client 
>>> environment:user.name<https://urldefense.proofpoint.com/v1/url?u=http://user.name&k=ppTtPKfaP99D%2F%2BB0F9qgBA%3D%3D%0A&r=x28Jl7CpmSGov0XieaueXQ%3D%3D%0A&m=4tTsrmnaFBZhAevZGQMu7MERiC%2BYL3QdI7iTMUIFqik%3D%0A&s=6c62d6ae4f01b70ca800eb1f50f3535de8fc467b57fdb6199cc7f1aad0789c99>
>>> =tlandells
>>> 2014-03-28T10:46:05.196115-07:00 tlandells-mesos02 mesos-slave[6242]:
>>> 2014-03-28 10:46:05,195:6242(0x7fc5fafcb700):ZOO_INFO@log_env@741:
>>> Client environment:user.home=/root
>>> 2014-03-28T10:46:05.196119-07:00 tlandells-mesos02 mesos-slave[6242]:
>>> 2014-03-28 10:46:05,195:6242(0x7fc5fafcb700):ZOO_INFO@log_env@753:
>>> Client environment:user.dir=/home/tlandells
>>> 2014-03-28T10:46:05.196122-07:00 tlandells-mesos02 mesos-slave[6242]:
>>> 2014-03-28 10:46:05,195:6242(0x7fc5fafcb700):ZOO_INFO@zookeeper_init@786:
>>> Initiating client connection, host=tlandells-mesos01.dev.box.net:2181,
>>> tlandells-mesos02.dev.box.net:2181,tlandells-mesos03.dev.box.net:2181sessionTimeout=10000
>>>  watcher=0x7fc602fc8cb0 sessionId=0
>>> sessionPasswd=<null> context=0x7fc5d8003b90 flags=0
>>> 2014-03-28T10:46:05.197040-07:00 tlandells-mesos02 mesos-slave[6242]:
>>> I0328 10:46:05.196420  6269 cgroups_isolator.cpp:223] Using /cgroup as
>>> cgroups hierarchy root
>>> 2014-03-28T10:46:05.197058-07:00 tlandells-mesos02 mesos-slave[6242]:
>>> Failed to mount cgroups hierarchy at '/cgroup' because we could not remove
>>> existing directory: Directory not empty
>>
>>
>> The command being invoked by /usr/bin/mesos-init-wrapper is:
>>
>>> /usr/local/sbin/mesos-slave --master=zk://
>>> tlandells-mesos01.dev.box.net:2181,tlandells-mesos02.dev.box.net:2181,
>>> tlandells-mesos03.dev.box.net:2181/mesos --log_dir=/var/log/mesos
>>> --isolation=cgroups
>>
>>
>> I haven't worked with cgroups before so I'm not clear if it's okay to
>> have Mesos using the default mounts or whether I should have something
>> special setup. I mean it's clear that Mesos wouldn't be able to empty out
>> /cgroup because it contains all the cgroup mounts...
>>
>> I saw a similar thread in the archives from about 6 months ago where the
>> issue was apparently a clash between Mesos and Docker, but I'm only trying
>> to run Mesos.
>>
>> The only thing that might be funky is that these are VMware instances I'm
>> currently using for testing.
>>
>> Does anyone have any experience running the cgroups isolator in SL (or
>> even RH, since it's basically just a rebadge) who can tell me what I'm
>> doing wrong?
>>
>> Thanks,
>> Tony
>>
>
>

Reply via email to