On Wed, 23 Oct 2013 10:06:12 +0200
Reuti Reuti wrote:

> Hi,
Hi Reuti,
 
> > # qconf -sc|egrep 'virtual_free|h_vmem|^#'
> > #name               shortcut     type        relop requestable
> > consumable default  urgency
> > #------------------------------------------------------------------------------------------
> > h_vmem              h_vmem       MEMORY      <=    YES
> > JOB        0        0 virtual_free        vf           MEMORY
> > <=    YES         JOB        0        0
> > 
> > 
> > yesterday I found a paralle job that asked for 64GB of h_vmem that
> > was using more than 100GB of mem but SGE did not kill it :
> 
> More than 100G in total or per slot (as the limit is multiplied)?
?? 

from sge_complex:

A  consumable  defined  by ’y’ is a per slot consumables which means
the limit is multiplied by the number of slots being used by the job
before being applied.  In case of ’j’ the consumable is a per job
consumable.

doesn't "JOB" mean per job total?
 
 
> > # qstat -j 2098938|grep vmem
> > hard resource_list:         virtual_free=64G,h_vmem=64G,h_rt=172800
> > usage    1:                 cpu=18:26:24, mem=111455.48587 GBs,
> > io=1735.61545, vmem=196.038G, maxvmem=197.132G
> 
> Can you please `grep` the messages file for the executing node for
> other entries of job "2098938".
 
# ls
active_jobs  job_scripts           messages-20130630.gz  messages-20130721.gz  
messages-20130811.gz  messages-20130901.gz  messages-20130922.gz  
messages-20131013.gz
execd.pid    messages              messages-20130707.gz  messages-20130728.gz  
messages-20130818.gz  messages-20130908.gz  messages-20130929.gz  
messages-20131020.gz
jobs         messages-20130623.gz  messages-20130714.gz  messages-20130804.gz  
messages-20130825.gz  messages-20130915.gz  messages-20131006.gz
# zgrep 2098938 messages*
# 

there are no entries for that job....

> -- Reuti
Thanks,
Arnau

_______________________________________________
users mailing list
[email protected]
https://gridengine.org/mailman/listinfo/users

Reply via email to