[
https://issues.apache.org/jira/browse/MESOS-3156?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
haosdent reassigned MESOS-3156:
-------------------------------
Assignee: haosdent
> Inconsistency between mesos master UI and mesos slave /metrics/snapshot
> -----------------------------------------------------------------------
>
> Key: MESOS-3156
> URL: https://issues.apache.org/jira/browse/MESOS-3156
> Project: Mesos
> Issue Type: Bug
> Components: master, slave, statistics, webui
> Affects Versions: 0.22.1
> Environment: Test environment runs on vagrant
> Master: Centos 7 + mesos 0.22.1 + marathon 0.9.0 = 1 vcpu + 1gb ram
> Slave: Centos 7 + mesos 0.22.1 = 3vcpus + 2048mb
> (1 master + 1 slave)
> Reporter: Sebastian Otaegui
> Assignee: haosdent
>
> We recently began doing some tests with kibana to graph some of the stats of
> the slaves and the masters.
> We found something pretty odd:
> Test case:
> In my example my slave has 1840 mb free, of which mesos reserves 920mb for
> tasks.
> 1. create N (in my case 14) marathon tasks with the following configuration
> {noformat}
> command: while true; do sleep 1 ; echo "heloo"; done
> mem: 64mb
> cpu: 0.1
> {noformat}
> 2. check the mesos master web UI
> {noformat}
> Total 3 920 MB
> Used 1.4 896 MB
> {noformat}
> 3. check the <slave host>:5051/metrics/snapshot
> {noformat}
> "slave/mem_total":920,
> "slave/mem_used”:1344
> {noformat}
> Is this correct? I discussed this issue on the DCOS community slack channel
> with Adam and he told me that the correct numbers are in the #3 he explained
> that for each task, there are about 32mb + 0.1 cpu that is assigned to a
> default executor.
> I also changed the slave to enable cgroups_limit_swap:
> {noformat}
> /etc/mesos-slave/
> ├── attributes
> ├── cgroups_limit_swap
> ├── containerizers
> ├── executor_registration_timeout
> ├── hostname
> ├── isolation
> ├── resources
> └── work_dir
> {noformat}
> {noformat}
> cat /etc/mesos-slave/cgroups_limit_swap
> true
> {noformat}
> {noformat}
> ps ax | grep slave
> 26810 ? Ssl 0:02 /usr/sbin/mesos-slave
> --master=zk://172.41.5.11:2181/mesos --ip=172.41.6.11
> --cgroups_limit_swap=true --containerizers=docker,mesos
> --executor_registration_timeout=30mins --hostname=172.41.6.11
> --isolation=cgroups/cpu,cgroups/mem --work_dir=/tmp/mesos
> {noformat}
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)