Hello,
After clearing all.q@compute-0-1 E status.
I'm getting this error when trying to use 16 slots with MPI:
error: executing task of job 127 failed: execution daemon on host
"compute-0-1.local" didn't accept task
El 13/11/2012 10:01, Guillermo Marco Puche escribió:
Hello,
Since today and I really don't know why when i request 16 slots in my
cluster i'm getting job queued in qw status forever.
I've 2 compute nodes each one has 8 cpus (8 slots).
When I request 8 sloths with '-pe mpi 8' or '-pe orte 8', it works
perfect and allocates task in compute-0-0.
When that job is running if i request 8 more slots with another job,
it never gets queued to compute-0-1 slots.
Job isn't showing any error or even typical "queue instances are full"
message.
If i request 16 slots at one it also gets queued forever.
Seems like compute-0-1 slots aren't provided or availabe.
Best regards,
Guillermo.
_______________________________________________
users mailing list
[email protected]
https://gridengine.org/mailman/listinfo/users
_______________________________________________
users mailing list
[email protected]
https://gridengine.org/mailman/listinfo/users