Hello,

Since today and I really don't know why when i request 16 slots in my cluster i'm getting job queued in qw status forever.
I've 2 compute nodes each one has 8 cpus (8 slots).

When I request 8 sloths with '-pe mpi 8' or '-pe orte 8', it works perfect and allocates task in compute-0-0. When that job is running if i request 8 more slots with another job, it never gets queued to compute-0-1 slots. Job isn't showing any error or even typical "queue instances are full" message.
If i request 16 slots at one it also gets queued forever.

Seems like compute-0-1 slots aren't provided or availabe.


Best regards,
Guillermo.
_______________________________________________
users mailing list
[email protected]
https://gridengine.org/mailman/listinfo/users

Reply via email to