On Wed, 23 Oct 2013 10:06:12 +0200 Reuti Reuti wrote: > Hi, Hi Reuti, > > # qconf -sc|egrep 'virtual_free|h_vmem|^#' > > #name shortcut type relop requestable > > consumable default urgency > > #------------------------------------------------------------------------------------------ > > h_vmem h_vmem MEMORY <= YES > > JOB 0 0 virtual_free vf MEMORY > > <= YES JOB 0 0 > > > > > > yesterday I found a paralle job that asked for 64GB of h_vmem that > > was using more than 100GB of mem but SGE did not kill it : > > More than 100G in total or per slot (as the limit is multiplied)? ??
from sge_complex: A consumable defined by āyā is a per slot consumables which means the limit is multiplied by the number of slots being used by the job before being applied. In case of ājā the consumable is a per job consumable. doesn't "JOB" mean per job total? > > # qstat -j 2098938|grep vmem > > hard resource_list: virtual_free=64G,h_vmem=64G,h_rt=172800 > > usage 1: cpu=18:26:24, mem=111455.48587 GBs, > > io=1735.61545, vmem=196.038G, maxvmem=197.132G > > Can you please `grep` the messages file for the executing node for > other entries of job "2098938". # ls active_jobs job_scripts messages-20130630.gz messages-20130721.gz messages-20130811.gz messages-20130901.gz messages-20130922.gz messages-20131013.gz execd.pid messages messages-20130707.gz messages-20130728.gz messages-20130818.gz messages-20130908.gz messages-20130929.gz messages-20131020.gz jobs messages-20130623.gz messages-20130714.gz messages-20130804.gz messages-20130825.gz messages-20130915.gz messages-20131006.gz # zgrep 2098938 messages* # there are no entries for that job.... > -- Reuti Thanks, Arnau _______________________________________________ users mailing list [email protected] https://gridengine.org/mailman/listinfo/users
