On Tue, 22 May 2012, Rayson Ho wrote:
For those who missed the Gompute User Group Meeting:
http://blogs.scalablelogic.com/2012/05/grid-engine-cgroups-integration.html
As most of the users are running Linux, it is now time to use a more
modern mechanism for the PDC to track process-job membership. We will
further enhance the cgroups integration beyond the Grid Engine 2011.11
update 1 release - eg. we are planning to support device whitelisting
in a future update.
Rayson
P.S. We will create a series of blog postings for the OGS/GE 2011.11
update 1 new features.
Hi Rayson,
I couldn't agree more, the existing mechanisms are extremely deficient.
h_vmem was never a perfect proxy, but in the 64-bit world it's extremely
poor.
I didn't want to mention it yet, as I'm still knee-deep in qmaster guts,
but I'm working on a patchset to make use of the memory cgroup controller.
The intention was to start with that only, as it's the most urgent cgroup
addition, but in a way that would hopefully allow easy extension to others
as appropriate.
Intended notable features of the patchset:
* Two new resources h_mem and s_mem to limit total memory + swap usage
(i.e. not just rss).
* Supplements the existing PDC mechanisms, to allow continued support of
operating systems that do not have cgroups (or equiv mechanism), or
installations who want to use h_vmem. In those instances, the intention is
that the jobs are killed when they exceed their request, but are not
otherwise enforced.
How far along with your solution are you? Am I just duplicating work
someone else has already done?
Mark
--
-----------------------------------------------------------------
Mark Dixon Email : [email protected]
HPC/Grid Systems Support Tel (int): 35429
Information Systems Services Tel (ext): +44(0)113 343 5429
University of Leeds, LS2 9JT, UK
-----------------------------------------------------------------
_______________________________________________
users mailing list
[email protected]
https://gridengine.org/mailman/listinfo/users