I have read through, to the best of my ability in
last couple days, the Grid Engine archives and
BioTeam's slides on various aspects of SGE, but
I can't seem to get RQS set up to do what I need:
Accept as many jobs (serial & parallel) as the
user submits but run a maximum of 10 jobs per
user at any given time OR let that user use a
maximum of 64 slots (used by 10 or less active
jobs) at any given time
Here's what I have done so far:
1. qconf -mconf
max_u_jobs 10
2. Added a rule in RQS
{
name max_slots_per_user
description Limit no. of active slots per user
enabled TRUE
limit users {*} to slots=64
}
Problems I have run into:
1. With #1 (max_u_jobs 10), SGE won't even accept
the 11th job user submits to the queue
2. I'm confused a bit with the terms 'slots' and
'jobs' (BioTeam slides treat slots as jobs).
To get some clarification, I set max_u_jobs
to 0 (unlimited) and modified the RQS rule
to limit the maximum slots to 5.
I then submitted 10 instances of 'Hello, World!'
(MPI) using 2 processors each and 10 instances
of 'sleeper.sh' using 1 processor each.
What I noticed is that the number of slots from
active jobs summed up to 5, irrespective of the
number of active jobs.
So, I do understand now that the term 'jobs' in
BioTeam slides actually mean 'slots'.
Help please? Any guidance towards further material
to read up on (man pages, websites, etc.) will be
greatly appreciated.
Best,
g
--
Gowtham
Information Technology Services
Michigan Technological University
(906) 487/3593
http://www.it.mtu.edu/
_______________________________________________
users mailing list
[email protected]
https://gridengine.org/mailman/listinfo/users