Am 04.04.2012 um 14:28 schrieb Esztermann, Ansgar:
> On Mar 28, 2012, at 17:31 , Reuti wrote:
>
>> Hi,
>>
>> Am 27.03.2012 um 15:42 schrieb Esztermann, Ansgar:
>>
>>> Hi everyone,
>>>
>>> while in general, all users are equal in our installation, I would like
>>> some nodes to have a longer maximum runtime for some users. In order to
>>> avoid oversubscription, we have only one queue per node. So instead of
>>> putting both a "medium" and a "long" queue on these nodes, I set up an RQS
>>> like this:
>>> {
>>> name lfn
>>> description Long Fat Nodes
>>> enabled TRUE
>>> limit users {aeszter,b,c} hosts {@lfn} to h_rt=2764800
>>> limit users {*} hosts {@lfn} to h_rt=604800
>>> }
>>>
>>> This works as expected for serial jobs, but parallel ones (even with one
>>> slot) refuse to start:
>>> #qalter -w p 1071136
>>> Job 1071136 cannot run because it exceeds limit "aeszter////node12-34/" in
>>> rule "lfn/1"
>>> #qstat -j 1071136
>>
>> just a follow up:
>>
>> Can you please try to add h_rt with an arbitrary high value to each exechost
>> (`qconf -me ...`)? Is it working then?
>
> No, it is not.
Thx for letting me know. I tries something else and suddenly it was working.
But maybe it was just coincidence with the underlying cause.
-- Reuti
_______________________________________________
users mailing list
[email protected]
https://gridengine.org/mailman/listinfo/users