I have read through, to the best of my ability in
last couple days, the Grid Engine archives and
BioTeam's slides on various aspects of SGE, but
I can't seem to get RQS set up to do what I need:


  Accept as many jobs (serial & parallel) as the
  user submits but run a maximum of 10 jobs per
  user at any given time OR let that user use a
  maximum of 64 slots (used by 10 or less active
  jobs) at any given time


Here's what I have done so far:

  1. qconf -mconf

     max_u_jobs 10

  2. Added a rule in RQS

     {
       name         max_slots_per_user
       description  Limit no. of active slots per user
       enabled      TRUE
       limit        users {*} to slots=64
     }


Problems I have run into:

  1. With #1 (max_u_jobs 10), SGE won't even accept
     the 11th job user submits to the queue

  2. I'm confused a bit with the terms 'slots' and
     'jobs' (BioTeam slides treat slots as jobs).
     To get some clarification, I set max_u_jobs
     to 0 (unlimited) and modified the RQS rule
     to limit the maximum slots to 5.

     I then submitted 10 instances of 'Hello, World!'
     (MPI) using 2 processors each and 10 instances
     of 'sleeper.sh' using 1 processor each.

     What I noticed is that the number of slots from
     active jobs summed up to 5, irrespective of the
     number of active jobs.

     So, I do understand now that the term 'jobs' in
     BioTeam slides actually mean 'slots'.


Help please? Any guidance towards further material
to read up on (man pages, websites, etc.) will be
greatly appreciated.

Best,
g

--
Gowtham
Information Technology Services
Michigan Technological University

(906) 487/3593
http://www.it.mtu.edu/

_______________________________________________
users mailing list
[email protected]
https://gridengine.org/mailman/listinfo/users

Reply via email to