Hi,

Am 27.04.2011 um 00:04 schrieb William Deegan:

> Is there a way to set the maximum rate at which new jobs will be launched on 
> a cluster (and/or a given machine)?
> 
> My client's worried about submitting a 100 jobs and having them all start at 
> the same time crushing the fileserver.

To throttle the overall job scheduling you will have to define two things:

$ qconf -ssconf
...
job_load_adjustments              np_load_avg=1
load_adjustment_decay_time        0:7:30


and in the queue configuration:

$ qconf -sq all.q
...
load_thresholds       np_load_avg=0.75


Each started job will put amn artificial load on each machine and avoid that 
too many jobs can start at the same time. For the waiting job you will see an 
output like:

$ qstat -j 416
...
scheduling info:            queue instance "all.q@fubar1" dropped because it is 
overloaded: np_load_avg=1.000000 (= 0.010000 + 1 * 0.990000 with nproc=1) >= 
0.75
                            queue instance "all.q@fubar2" dropped because it is 
overloaded: np_load_avg=1.020000 (= 0.030000 + 1 * 0.990000 with nproc=1) >= 
0.75


until the artificial load decreased again. This virtual alarm state of a queue 
won't show up in `qstat -f` though. The exact values of "job_load_adjustments", 
"load_adjustment_decay_time" and "load_thresholds" need to be adjusted by 
experiment to your needs.

-- Reuti
_______________________________________________
users mailing list
[email protected]
https://gridengine.org/mailman/listinfo/users

Reply via email to