On Friday, 9 November 2018 2:16:48 PM AEDT Noam Bernstein wrote: > Can anyone shed some light on where the _virtual_ memory limit comes from? > > We're getting jobs killed with the message > slurmstepd: error: Step 3664.0 exceeded virtual memory limit (79348101120 > > 72638634393), being killed > > Is this a limit that's dictated by cgroup.conf
It's not cgroups, that is enforced by the kernel instead, whereas this is Slurm monitoring jobs and deciding it's used too much memory and it needs to kill it. All the best, Chris -- Chris Samuel : http://www.csamuel.org/ : Melbourne, VIC