Am 11.10.2012 um 12:21 schrieb Xavier:

> I did it with qmon:
> Year: NONE
> Week: mon -wed=1-9=suspended thu=1-11=suspended fri-sun=1-9=suspended
> i have called it "short".

Thx - I was wondering whether you put "off" there which would only disallow new 
jobs.


> Then still with qmon i have modified short.q by selected "short" as a 
> calendar.
> 
> 
> Since i had change the time this morning qstat says:
> 
> 28865 0.58449 JOB1 xavier       S     10/03/2012 09:44:18 
> [email protected]         32

According to S the job is suspended. Does `qstat -f` show state C for the queue 
(calendar suspended)?

Did you check with:

$ ps -e f

on the n ode that all processes are kids of the sge_shepherd? They should have 
gotten state "T" then in `ps`.

-- Reuti


> but again while compute-0-2 is having a load of 8 (8cpus/nodes) compute-0-1 
> and others are overloading at 16...
> 
> using SGE 6.2u4 on a ROCKS 5.3 cluster
> 
> On 11/10/2012 11:09, Reuti wrote:
>> Am 11.10.2012 um 11:56 schrieb Xavier:
>> 
>>> Hi all,
>>> 
>>> I have created a calendar queue only available during the day (6am to 1am) 
>>> keeping nodes free for the night jobs trough an other queue.
>>> This queue is composed of 4 nodes (32cpus). All jobs used the 32cpus
>> Good - but what calendar definition did you create in detail?
>> 
>> -- Reuti
>> 
>> 
>>> what i don't get is that one of the nodes AND ONLY ONE drop its load 0 at 
>>> 1am. this node is the one where the job attributed, i.e.
>>> 
>>> from qstat
>>> JOB1 xavier       r     10/03/2012 09:44:18 [email protected]     32
>>> 
>>> while other 3 nodes  keep their load and therefore overload at night.
>>> 
>>> example of last day load
>>> for compute-0-2
>>> http://nautilus.ciimar.up.pt/ganglia/graph.php?g=load_report&z=large&c=nautilus&h=compute-0-2.local&m=load_one&r=day&s=descending&hc=4&mc=2&st=1349865411
>>> and for compute-0-1
>>> http://nautilus.ciimar.up.pt/ganglia/graph.php?g=load_report&z=large&c=nautilus&h=compute-0-1.local&m=load_one&r=day&s=descending&hc=4&mc=2&st=1349865467
>>> 
>>> Why does all nodes not behave the same ?
>>> 
>>> Xavier
>>> 
>>> -- 
>>> Universidade da Madeira
>>> CCM - Centro de Ciencias Matematicas
>>> Campus Universitario da Penteada
>>> 9000-390 Funchal, Madeira Island
>>> Portugal
>>> 
>>> (+351) 291 705 186
>>> http://wakes.uma.pt
>>> 
>>> _______________________________________________
>>> users mailing list
>>> [email protected]
>>> https://gridengine.org/mailman/listinfo/users
> 
> -- 
> Universidade da Madeira
> CCM - Centro de Ciencias Matematicas
> Campus Universitario da Penteada
> 9000-390 Funchal, Madeira Island
> Portugal
> 
> (+351) 291 705 186
> http://wakes.uma.pt
> 


_______________________________________________
users mailing list
[email protected]
https://gridengine.org/mailman/listinfo/users

Reply via email to