On Mon, 21 Jan 2013 02:28:16 -0700, Bjørn-Helge Mevik <[email protected]> wrote: > > Christopher Samuel <[email protected]> writes: > > As for letting cgroups notify the job instead of killing it, that is > probably hard to implement, because the cgroups limiting is done by the > kernel itself, not slurm, and I at least don't know of any > callback-hooks or other features in cgroups that could be used for such a > thing.
In upstream kernels there is already a feature for setting up memcg notifications, and the majority of this should be backported to RHEL as of RHEL6.4. http://www.mjmwired.net/kernel/Documentation/cgroups/memory.txt See Section 9 in the link above. An application can register for notification of when a usage_in_bytes threshold is crossed using the event_fd() mechanism. This can be useful for applications that have the ability to take some action when memory is tight (e.g. release some cache, shrink bufferes, etc.) Note that RHEL6.4 should also have the feature described in Sec 10, i.e. the ability to disable the oom killer for a memcg. mark > -- > Cheers, > Bjørn-Helge Mevik, dr. scient, > Research Computing Services, University of Oslo
