[slurm-dev] Re: MaxMemPerNode

jette Wed, 21 May 2014 07:41:08 -0700


Hi Bill,


Here are a couple of ideas:

At job end, compare each job's memory specification against actual useand work with the offending users.You might configure DefaultMemPerCPU and MaxMemPerCPU to match the CPUand memory allocations (e.g. if 8 CPUs and 8GB, then set bothconfiguration parameters to 1G). Then if someone requests all of thememory on the node they will get all of the CPUs as well.In a job_submit plugin, set a nice value for jobs requesting a lot ofmemory (i.e. lower their scheduling priority).You could configure MaxMemPerNode, but that would probably impactusers who really need a lot of memory.


Moe Jette
SchedMD

Quoting Bill Wichser <[email protected]>:

In doing accounting on past jobs, we are trying to figure out how toaccount for memory usage as well as core usage. What began as ananomaly has now turned into something my users have found to workquite effectively for their jobs, and that is to add the line:
#SBATCH --mem=MaxMemPerNode

We do share our nodes so this is an unacceptable specification.
Before going down the path of adding yet another check in thejob_submit.lua script, I am wondering if there isn't a better way.Currently I do not have this value configured so when I do a"scontrol show config" it comes up as UNLIMITED, not at all what Iwant. Ideally I'd set this to some small value but suspect thatthis would have repercussions further along when users actually doallocate the correct amount and that value exceeds thisMaxMemPerNode value I'd set low.
Yes, I could just inform my users that this is unacceptablebehavior. But we all know that without policing, it will ariseagain so I'd much rather deal with this once and for all either byadding athe "right" value to slurm.conf or I'll just reject jobsusing this variable altogether.
Thanks,
Bill

[slurm-dev] Re: MaxMemPerNode

Reply via email to