Hi,

I have implemented the soft/hard limits on the MAXPROC parameter, as per the maui manual (maui version 3.2.6p13, torque 2.0.0p2), but I am seeing unexpected behavior:

- defined the parameter group-wise as follows
GROUPCFG[A]        MAXPROC=100,200
GROUPCFG[B]        MAXPROC=50,175
GROUPCFG[C]        MAXPROC=40,175
GROUPCFG[DEFAULT]  MAXPROC=30,175

- there are 5 cpus free and no jobs are in Q. Here is the group-wise processor usage (active procs used):
group A         8
group B         10
group C         57
group other1    61
group other2    5
group X         88

- A user from group X then submits several more jobs. Each of these jobs remains in Q state with checkjob reporting that "job 55059 violates active SOFT MAXPROC limit of 30 for group X". There are still 5 cpus free.

This does not seem normal. I would have expected that group X would be allowed to run jobs until they hit total procs=175, since there are no other competing jobs. Infact, the way I read the manual, this setup would allow groups 'other1', 'other2' and X to go upto max 175 procs each. Right?

Also, the diagnostic message from maui is misleading. The soft-limit of 30 has long been crossed !

Can anyone suggest what am I missing in this picture? What's the correlation between these numbers?

Thanks.

-Neel
_______________________________________________
mauiusers mailing list
[email protected]
http://www.supercluster.org/mailman/listinfo/mauiusers

Reply via email to