Hi,
I'm having a hard time configuring a simple queue with more priority than
the other. I may be no seeing something really big, so I ask for your help.
Queue "rapida" must have, always, more priority than queue "default", which
is the default queue. I've configured maui and, if I submit several test
jobs, I seem to get the priorities right:
Job PRIORITY* Cred( User:Group:Class) Serv(QTime)
Weights -------- 1( 1: 1: 10) 1( 1)
6741 100010 100.0( 0.0: 0.0:10000) 0.0( 10.1)
6756 100003 100.0( 0.0: 0.0:10000) 0.0( 2.8)
6757 100003 100.0( 0.0: 0.0:10000) 0.0( 2.8)
6758 100003 100.0( 0.0: 0.0:10000) 0.0( 2.8)
6759 100003 100.0( 0.0: 0.0:10000) 0.0( 2.8)
6760 100003 100.0( 0.0: 0.0:10000) 0.0( 2.8)
6761 100003 100.0( 0.0: 0.0:10000) 0.0( 2.8)
6762 100003 100.0( 0.0: 0.0:10000) 0.0( 2.8)
6763 13 78.4( 0.0: 0.0: 1.0) 21.6( 2.8)
6764 13 78.4( 0.0: 0.0: 1.0) 21.6( 2.8)
6765 13 78.4( 0.0: 0.0: 1.0) 21.6( 2.8)
6766 13 78.4( 0.0: 0.0: 1.0) 21.6( 2.8)
6767 13 78.4( 0.0: 0.0: 1.0) 21.6( 2.8)
6768 13 78.4( 0.0: 0.0: 1.0) 21.6( 2.8)
6769 13 78.4( 0.0: 0.0: 1.0) 21.6( 2.8)
But a minute later, the ones with priority 13 get scheduled to run!
Job PRIORITY* Cred( User:Group:Class) Serv(QTime)
Weights -------- 1( 1: 1: 10) 1( 1)
6741 100010 100.0( 0.0: 0.0:10000) 0.0( 10.4)
6756 100003 100.0( 0.0: 0.0:10000) 0.0( 3.0)
6757 100003 100.0( 0.0: 0.0:10000) 0.0( 3.0)
6758 100003 100.0( 0.0: 0.0:10000) 0.0( 3.0)
6759 100003 100.0( 0.0: 0.0:10000) 0.0( 3.0)
6760 100003 100.0( 0.0: 0.0:10000) 0.0( 3.0)
6761 100003 100.0( 0.0: 0.0:10000) 0.0( 3.0)
6762 100003 100.0( 0.0: 0.0:10000) 0.0( 3.0)
6766 13 76.6( 0.0: 0.0: 1.0) 23.4( 3.0)
6767 13 76.6( 0.0: 0.0: 1.0) 23.4( 3.0)
6768 13 76.6( 0.0: 0.0: 1.0) 23.4( 3.0)
6769 13 76.6( 0.0: 0.0: 1.0) 23.4( 3.0)
Percent Contribution -------- 100.0( 0.0: 0.0:100.0) 0.0( 0.0)
Some time later, 6741 (priority: 100010) started.
So it seems maui is scheduling some jobs from one queue, and some from the
other, when it should just use the priority and schedule all the ones that
belong to the "rapida" queue first.
I ran into the output of "diagnose -c"
Class/Queue Status
Name Priority Flags QDef QOSList* PartitionList Target Limits
rapida 10000 [NONE] [NONE] [NONE] [NONE] 0.00 MAXJOB=2:100000
MAXJOBPERUSER=100000,100000
default 1 [NONE] [NONE] [NONE] [NONE] 0.00 [NONE]
What does that "MAXJOB=2:100000" means? I can't find MAXJOB defined anywhere
in maui.cfg nor in showconfig, nor in qmgr's "p s".
My maui.cfg looks like this:
----------------------------------------
RMPOLLINTERVAL 00:00:30
SERVERHOST bose.fisica.unlp.edu.ar
SERVERPORT 42559
SERVERMODE NORMAL
RMCFG[base] TYPE=PBS
# Admin: http://supercluster.org/mauidocs/a.esecurity.html
# ADMIN1 users have full scheduler control
ADMIN1 maui root
LOGFILE maui.log
LOGFILEMAXSIZE 10000000
LOGLEVEL 3
# Job Priority: http://supercluster.org/mauidocs/5.1jobprioritization.html
QUEUETIMEWEIGHT 1
# Throttling Policies:
http://supercluster.org/mauidocs/6.2throttlingpolicies.html
# NONE SPECIFIED
# Backfill: http://supercluster.org/mauidocs/8.2backfill.html
BACKFILLPOLICY FIRSTFIT
RESERVATIONPOLICY CURRENTHIGHEST
# Node Allocation: http://supercluster.org/mauidocs/5.2nodeallocation.html
#NODEALLOCATIONPOLICY MINRESOURCE
NODEALLOCATIONPOLICY PRIORITY
# QOS: http://supercluster.org/mauidocs/7.3qos.html
CLASSCFG[rapida] PRIORITY=10000 MAXJOB=8
CLASSCFG[default] PRIORITY=1
CLASSWEIGHT 10
CREDWEIGHT 1
USERWEIGHT 1
GROUPWEIGHT 1
# Standing Reservations:
http://supercluster.org/mauidocs/7.1.3standingreservations.html
# SRSTARTTIME[test] 8:00:00
# SRENDTIME[test] 17:00:00
# SRDAYS[test] MON TUE WED THU FRI
# SRTASKCOUNT[test] 20
# SRMAXTIME[test] 0:30:00
# Creds: http://supercluster.org/mauidocs/6.1fairnessoverview.html
# USERCFG[DEFAULT] FSTARGET=25.0
# USERCFG[john] PRIORITY=100 FSTARGET=10.0-
# GROUPCFG[staff] PRIORITY=1000 QLIST=hi:low QDEF=hi
# CLASSCFG[batch] FLAGS=PREEMPTEE
# CLASSCFG[interactive] FLAGS=PREEMPTOR
------------------------
schedctl -l returns:
-------------------------------------
# Maui version 3.2.6p19 (PID: 26905)
# global policies
REJECTNEGPRIOJOBS[0] FALSE
ENABLENEGJOBPRIORITY[0] FALSE
ENABLEMULTINODEJOBS[0] TRUE
ENABLEMULTIREQJOBS[0] FALSE
BFPRIORITYPOLICY[0] [NONE]
JOBPRIOACCRUALPOLICY QUEUEPOLICY
NODELOADPOLICY ADJUSTSTATE
USEMACHINESPEED FALSE
USESYSTEMQUEUETIME TRUE
USELOCALMACHINEPRIORITY FALSE
NODEUNTRACKEDLOADFACTOR 1.2
JOBNODEMATCHPOLICY[0]
JOBMAXSTARTTIME[0] INFINITY
METAMAXTASKS[0] 0
NODESETPOLICY[0] [NONE]
NODESETATTRIBUTE[0] [NONE]
NODESETLIST[0]
NODESETDELAY[0] 00:00:00
NODESETPRIORITYTYPE[0] MINLOSS
NODESETTOLERANCE[0] 0.00
BACKFILLPOLICY[0] FIRSTFIT
BACKFILLDEPTH[0] 0
BACKFILLPROCFACTOR[0] 0
BACKFILLMAXSCHEDULES[0] 10000
BACKFILLMETRIC[0] PROCS
BFCHUNKDURATION[0] 00:00:00
BFCHUNKSIZE[0] 0
PREEMPTPOLICY[0] REQUEUE
MINADMINSTIME[0] 00:00:00
RESOURCELIMITPOLICY[0]
NODEAVAILABILITYPOLICY[0] COMBINED:[DEFAULT]
NODEALLOCATIONPOLICY[0] PRIORITY
TASKDISTRIBUTIONPOLICY[0] DEFAULT
RESERVATIONPOLICY[0] CURRENTHIGHEST
RESERVATIONRETRYTIME[0] 00:00:00
RESERVATIONTHRESHOLDTYPE[0] NONE
RESERVATIONTHRESHOLDVALUE[0] 0
FSPOLICY [NONE]
FSPOLICY [NONE]
FSINTERVAL 12:00:00
FSDEPTH 8
FSDECAY 1.00
# Priority Weights
SERVICEWEIGHT[0] 1
TARGETWEIGHT[0] 1
CREDWEIGHT[0] 1
ATTRWEIGHT[0] 1
FSWEIGHT[0] 1
RESWEIGHT[0] 1
USAGEWEIGHT[0] 1
QUEUETIMEWEIGHT[0] 1
XFACTORWEIGHT[0] 0
SPVIOLATIONWEIGHT[0] 0
BYPASSWEIGHT[0] 0
TARGETQUEUETIMEWEIGHT[0] 0
TARGETXFACTORWEIGHT[0] 0
USERWEIGHT[0] 1
GROUPWEIGHT[0] 1
ACCOUNTWEIGHT[0] 0
QOSWEIGHT[0] 0
CLASSWEIGHT[0] 10
FSUSERWEIGHT[0] 0
FSGROUPWEIGHT[0] 0
FSACCOUNTWEIGHT[0] 0
FSQOSWEIGHT[0] 0
FSCLASSWEIGHT[0] 0
ATTRATTRWEIGHT[0] 0
ATTRSTATEWEIGHT[0] 0
NODEWEIGHT[0] 0
PROCWEIGHT[0] 0
MEMWEIGHT[0] 0
SWAPWEIGHT[0] 0
DISKWEIGHT[0] 0
PSWEIGHT[0] 0
PEWEIGHT[0] 0
WALLTIMEWEIGHT[0] 0
UPROCWEIGHT[0] 0
UJOBWEIGHT[0] 0
CONSUMEDWEIGHT[0] 0
USAGEEXECUTIONTIMEWEIGHT[0] 0
REMAININGWEIGHT[0] 0
PERCENTWEIGHT[0] 0
XFMINWCLIMIT[0] 00:02:00
# partition DEFAULT policies
REJECTNEGPRIOJOBS[1] FALSE
ENABLENEGJOBPRIORITY[1] FALSE
ENABLEMULTINODEJOBS[1] TRUE
ENABLEMULTIREQJOBS[1] FALSE
BFPRIORITYPOLICY[1] [NONE]
JOBPRIOACCRUALPOLICY QUEUEPOLICY
NODELOADPOLICY ADJUSTSTATE
JOBNODEMATCHPOLICY[1]
JOBMAXSTARTTIME[1] INFINITY
METAMAXTASKS[1] 0
NODESETPOLICY[1] [NONE]
NODESETATTRIBUTE[1] [NONE]
NODESETLIST[1]
NODESETDELAY[1] 00:00:00
NODESETPRIORITYTYPE[1] MINLOSS
NODESETTOLERANCE[1] 0.00
# Priority Weights
XFMINWCLIMIT[1] 00:00:00
RMAUTHTYPE[0] CHECKSUM
CLASSCFG[[NONE]] DEFAULT.FEATURES=[NONE]
CLASSCFG[[ALL]] DEFAULT.FEATURES=[NONE]
CLASSCFG[rapida] DEFAULT.FEATURES=[NONE]
CLASSCFG[default] DEFAULT.FEATURES=[NONE]
QOSPRIORITY[0] 0
QOSQTWEIGHT[0] 0
QOSXFWEIGHT[0] 0
QOSTARGETXF[0] 0.00
QOSTARGETQT[0] 00:00:00
QOSFLAGS[0]
QOSPRIORITY[1] 0
QOSQTWEIGHT[1] 0
QOSXFWEIGHT[1] 0
QOSTARGETXF[1] 0.00
QOSTARGETQT[1] 00:00:00
QOSFLAGS[1]
# SERVER MODULES: MX
SERVERMODE NORMAL
SERVERNAME
SERVERHOST bose.fisica.unlp.edu.ar
SERVERPORT 42559
LOGFILE maui.log
LOGFILEMAXSIZE 10000000
LOGFILEROLLDEPTH 1
LOGLEVEL 3
LOGFACILITY fALL
SERVERHOMEDIR /opt/maui/
TOOLSDIR /opt/maui/tools/
LOGDIR /opt/maui/log/
STATDIR /opt/maui/stats/
LOCKFILE /opt/maui/maui.pid
SERVERCONFIGFILE /opt/maui/maui.cfg
CHECKPOINTFILE /opt/maui/maui.ck
CHECKPOINTINTERVAL 00:05:00
CHECKPOINTEXPIRATIONTIME 3:11:20:00
TRAPJOB
TRAPNODE
TRAPFUNCTION
RESDEPTH 24
RMPOLLINTERVAL 00:00:30
NODEACCESSPOLICY SHARED
ALLOCLOCALITYPOLICY [NONE]
SIMTIMEPOLICY [NONE]
ADMIN1 maui root
ADMINHOSTS ALL
NODEPOLLFREQUENCY 0
DISPLAYFLAGS
DEFAULTDOMAIN .local
DEFAULTCLASSLIST [DEFAULT:1]
FEATURENODETYPEHEADER
FEATUREPROCSPEEDHEADER
FEATUREPARTITIONHEADER
DEFERTIME 1:00:00
DEFERCOUNT 24
DEFERSTARTCOUNT 1
JOBPURGETIME 0
NODEPURGETIME 2140000000
APIFAILURETHRESHHOLD 6
NODESYNCTIME 600
JOBSYNCTIME 600
JOBMAXOVERRUN 00:10:00
NODEMAXLOAD 0.0
PLOTMINTIME 120
PLOTMAXTIME 245760
PLOTTIMESCALE 11
PLOTMINPROC 1
PLOTMAXPROC 512
PLOTPROCSCALE 9
SCHEDCFG[] MODE=NORMAL
SERVER=bose.fisica.unlp.edu.ar:42559
# RM MODULES: PBS SSS WIKI NATIVE
RMCFG[base] AUTHTYPE=CHECKSUM EPORT=15004 TIMEOUT=00:00:09 TYPE=PBS
SIMWORKLOADTRACEFILE workload
SIMRESOURCETRACEFILE resource
SIMAUTOSHUTDOWN OFF
SIMSTARTTIME 0
SIMSCALEJOBRUNTIME FALSE
SIMFLAGS
SIMJOBSUBMISSIONPOLICY CONSTANTJOBDEPTH
SIMINITIALQUEUEDEPTH 16
SIMWCACCURACY 0.00
SIMWCACCURACYCHANGE 0.00
SIMNODECOUNT 0
SIMNODECONFIGURATION NORMAL
SIMWCSCALINGPERCENT 100
SIMCOMRATE 0.10
SIMCOMTYPE ROUNDROBIN
COMINTRAFRAMECOST 0.30
COMINTERFRAMECOST 0.30
SIMSTOPITERATION -1
SIMEXITITERATION -1
-------------------------------------------------------
Thanks in advance
-- Diego.
_______________________________________________
mauiusers mailing list
[email protected]
http://www.supercluster.org/mailman/listinfo/mauiusers