{Def,Max}MemPer{CPU,Node} seem used to avoid over-subscribing memory and 
causing paging. Although I read slurm.conf, I have further questions about them.

 

1.     MaxPerCPU and MaxPerNode look for setting maximum available memory 
sizes. What exactly DefPerCPU and DefPerNode mean?

2.     What happens if a single process requests more memory larger than the 
specification? How would it work differently for each specified values?

3.     If each node has a different memory size, how can I set different 
DefMemPerNode or MaxMemPerNode value for each node?

4.     Why would {Def,Max}MemPerCPU generally be used with select/cons_res and 
{def,Max}MemPerNode generally be used with select/linear? Would it be okay to 
use {def,Max}MemPerNode with select/cons_res?

 

Thank you.

 

Joo-Kyung

 

From: David Bigagli 
[http://lists.schedmd.com/cgi-bin/dada/mail.cgi/r/slurmdev/577441466023/] 
Sent: Tuesday, March 12, 2013 12:03 PM
To: slurm-dev
Subject: [slurm-dev] Re: Memory swapping, and transition delay issues.

 

Hi, the problem of memory over-subscription is discusses in 'man slurm.conf'. 
Have a look at 

DefMemPerCPU, DefMemPerNode and the suggested configuration when using 
CR_CPU_Memory.




/David

 

On Tue, Mar 12, 2013 at 3:15 PM, Joo-Kyung Kim <[email protected] 
<http://lists.schedmd.com/cgi-bin/dada/mail.cgi/r/slurmdev/690317113748/> > 
wrote:

Hi,

 

I am using SLURM 2.4.0.

 

When I run many memory intensive processes at the same time using a slurm job, 
there are a lot of memory swappings by kswapd0.

 

I changed SelectTypeParameters from CR_CPU to CR_CPU_MEMORY (SchedulerType = 
sched/backfill and SelectType = select/cons_res aren’t changed), I am not sure 
there would be some significant reduction of memory swapping.

 

Would there be a good way to automatically reduce the high memory swapping?

 

 

Another question: It seems that it takes more than 15 seconds for completing 
jobs to be completed jobs so that the next jobs can be runned on a cluster with 
8 nodes.

 

I wonder it’s normal that the transition takes some time or would there be any 
problem?

 

Thank you.

 

Joo-Kyung

  <http://lists.schedmd.com/cgi-bin/dada/mail.cgi/r/slurmdev/449583192704/> 

 

  <http://lists.schedmd.com/cgi-bin/dada/mail.cgi/r/slurmdev/858117913508/> 

Reply via email to