Re: [gridengine users] Questions about h_vmem

Fan Dong Wed, 09 Apr 2014 06:49:44 -0700

# qconf -se comp92

...
load_scaling          NONE
complex_values        slots=8,num_proc=8,h_vmem=32g
load_values arch=linux-x64,num_proc=8,mem_total=32108.289062M, \
swap_total=1999.992188M,virtual_total=34108.281250M, \
load_avg=1.000000,load_short=1.000000, \
load_medium=1.000000,load_long=1.000000, \
mem_free=30152.914062M,swap_free=1999.992188M, \
virtual_free=32152.906250M,mem_used=1955.375000M, \
swap_used=0.000000M,virtual_used=1955.375000M, \
cpu=12.500000,m_topology=SCTTSCTTSCTTSCTT, \
m_topology_inuse=SCTTSCTTSCTTSCTT,m_socket=4,m_core=4, \
np_load_avg=0.125000,np_load_short=0.125000, \
np_load_medium=0.125000,np_load_long=0.125000
...
...


On 04/09/2014 09:45 AM, Reuti wrote:

Am 09.04.2014 um 15:43 schrieb Fan Dong:

Here comes another question.  comp02 is one exec nodes, it has 32GB memory.  
bigmem_16.q has 2 slots on this node.  A bunch of jobs are submitted with -l 
h_vmem=16G.  At the first a few hours,  both 2 slots on comp92 are used just as 
expected until this morning I found the following numbers that I donot 
understand.  First off, only one slot on comp92 is used (the same is true for 
rest of our nodes similar to comp92). The h_vmem on comp92 shows 13.802G and 
that's why jobs are held in the queue because they request h_vmem=16G.  What 
might be the cause for this 'weird' h_vmem value?


The is taken from running 'qhost -F h_vmem'

HOSTNAME                ARCH         NCPU  LOAD  MEMTOT  MEMUSE  SWAPTO  SWAPUS
comp92                  linux-x64       8  1.00   31.4G    1.9G    2.0G     0.0
     Host Resource(s):      hc:h_vmem=13.802G


This is taken from running 'qstat -f -q bigmem_16.q'

queuename                      qtype resv/used/tot. load_avg arch          
states
---------------------------------------------------------------------------------
[email protected] BI    0/1/2          1.00     linux-x64
   44946 0.55500 pipeline_w training     r     04/09/2014 07:19:30     1

What was defined in `qconf -se comp92` for "h_vmem"?

-- Reuti

On 04/08/2014 12:13 PM, Reuti wrote:

Am 08.04.2014 um 16:45 schrieb Fan Dong:

On 04/08/2014 10:41 AM, Reuti wrote:

Am 08.04.2014 um 16:34 schrieb Fan Dong:

On 04/08/2014 10:17 AM, Reuti wrote:

Am 08.04.2014 um 16:02 schrieb Fan Dong:

Thanks for the help.  I guess part of my original question was 'will h_vmem 
help the scheduler to hold off the job if the node does not have enough h_vmem 
left?'

Say, we have
        • a consumable h_vmem (qconf -mc) with default value 4GB,
        • the exec host h1 and h2 both have h_vmem = 32GB  (qconf -me),
        • the queue a.q is configured with 18GB h_vmem (qconf -mq).

What happens a user sends 3 jobs to a.q, assuming there are more than two slots 
on each of the host ? -- will
        • 3 jobs get to run simultaneously?

Yes. 4 GB times 3 will fit into the available 32 GB.

Then what is the use of h_vmem setup in the queue??? h_vmem has the value of 
18GB in a.q,  how does that come into the play?

It is the maximum a user can request per job.

They get 4 GB by default, but they can request more - up to 18 GB for a 
particular job. In case they request more than 18GB , the job will never start.

Nevertheless, the overall consumption of memory will be restricted by the 
definition on the host level, i.e. that all jobs in total on an exechost will 
never exceed 32 GB.

Excellent!  but just to double check - if a user does not explicitly use qsub 
-l h_vmem in the submission script, the default 4GB will be used. Is that 
correct?

As long as the "h_vmem" complex is assigned to each exechost with a suitable 
value: yes.

-- Reuti

-- Reuti

  Shouldn't the h_vmem in the queue override the default global consumable 
value??  You suggested earlier that the h_vmem attached to the queue is 
enforced per job but your calculation '4GB times 3' seems ignore the h_vmem in 
the queue.  Could you please clarify?  Thank you.

In case the user requests more memory, like 18 GB for each of them, it will be 
different of course.

-- Reuti

        • or there is a job has to be held off ?  (because h_vmem on each of 
the host will decrease to 32-18=14G, not enough for the third job)


  On 04/07/2014 11:29 AM, Reuti wrote:

Hi,

Am 07.04.2014 um 17:10 schrieb Fan Dong:

I am a little confused about the consumable h_vmem setup on the node and the 
queue.  Let's say we have one queue, called a.q, spans two host, h1 and h2.  h1 
has 32GB of ram and h2 has 128GB.

I attached h_vmem to both hosts, using the value of actual physical ram,

You defined this value `qconf -me ...` => "complex_values"?

also a.q has default h_vmem value of 18GB, which is the peak memory usage of 
the job.

Yes, the setting in the queue is per job, while in the exechost definition it's 
across all jobs.

Here is how I understand the way h_vmem works.  When the first job in a.q is 
sent to node h1, the h_vmem on the node will decrease to 32-18=14GB,

Did you make the "h_vmem" complex consumable in `qconf -mc`? What is the 
default value specified there for it?

You check with `qhost -F h_vmem` and the values are not right?

the h_vmem attached to queue will make sure that job won't use memory more than 
18GB.  When the second job comes in, it will be sent to node h2 because there 
is no enough h_vmem on node h1 left.

...as the value was subtracted on a host level.

I am not sure if I am correct about the h_vmem as I have an impression h_vmem 
won't stop jobs from being sent to a node but virtual_free does.  Any 
suggestions?

Keep in mind, the "h_vmem" is a hard limit, while "virtual_free" is a hint for 
SGE how to distribute jobs while it allows to consume more than requested. It depends on the 
workflow what fits best.

-- Reuti

_______________________________________________
users mailing list
[email protected]
https://gridengine.org/mailman/listinfo/users

Re: [gridengine users] Questions about h_vmem

Reply via email to