Hello, we had the same concern a few months ago when trying to address issues encountered with cfs bandwidth mechanism. Eventually, we ended up implementing our own module to use cpuset cgroups for our task using our generic isolator mechanism (https://github.com/criteo/mesos-command-modules/). Our observation is that using cpusets instead of cfs bandwidth reduced cpu consumption from 4-10% without any loss in performance. We are also currently working on leveraging numa topology (in a very simple manner) to better allocate cpus to tasks.
Would be happy to discuss with other users having the same kind of challenges! -- Grégoire ________________________________ From: Abel Souza <[email protected]> Sent: Monday, October 21, 2019 4:41 PM To: user <[email protected]> Subject: [BULK]Task Pinning Hi, Does anyone know if pinning capabilities will ever be available to Mesos? Someone registered an issue at Jira (https://eur03.safelinks.protection.outlook.com/?url=https%3A%2F%2Fissues.apache.org%2Fjira%2Fbrowse%2FMESOS-5342&data=02%7C01%7Cg.seux%40criteo.com%7Ca1f9cfe9118f400d6d6f08d75634ccbd%7C2a35d8fd574d48e3927c8c398e225a01%7C1%7C1%7C637072657076867754&sdata=SqTxAjWI3Ea%2BDdydhQEEAGWmHrO5bXn3kPg%2Bpconk9c%3D&reserved=0), started an implementation (https://eur03.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2Fct-clmsn%2Fmesos-cpusets&data=02%7C01%7Cg.seux%40criteo.com%7Ca1f9cfe9118f400d6d6f08d75634ccbd%7C2a35d8fd574d48e3927c8c398e225a01%7C1%7C1%7C637072657076867754&sdata=Xy0utL9uI%2FOA5r%2B5gB6Q%2BWKjpwnbbozZA%2FVorIFU98w%3D&reserved=0), but apparently it never went through mainline. I successfully compiled it in my testbed and loaded it into the Mesos master agent, but it keeps crashing the master during the submission process. So before moving on into potential fixes to these crashes, I would like to know if someone knows about possible updates to this specific capability in future Mesos releases. Thank you, /Abel

