Am 06.02.2012 um 22:44 schrieb Reuti:

> <snip>
> As the parent jobs are not doing any work, a special parent.q would do which 
> has to be requested by a forced boolean complex, so that only parent jobs can 
> get in. You could even set a h_cpu limit on this queue to avoid abuse - jobs 
> abusing this queue would get killed after 5 minutes or so. The overall slot 
> count used in this cluster queue you can limit in an RQS.

I didn't mention it: you oversubscribe by intention. In case you have further 
queues on each host, the RQS for limiting the overall slot count will need two 
entries:

limit name parents queues parent.q hosts {*} to slots=1
limit name slots hosts {*} to slots=$num_proc

The first matching rule will end the RQS. The second RQS for the parent.q 
overall use it like usual:

limit name parents queues parent.q to slots=17

Please adjust the numbers to your needs. You could even run all parents on one 
machine, but I would spread it around.

-- Reuti
_______________________________________________
users mailing list
[email protected]
https://gridengine.org/mailman/listinfo/users

Reply via email to