Hi Henk,

The sge script should not require any extra parameter. The qdel command should send the kill signal to mpirun and also remove the SGE allocated tmp directory (in something like /tmp/174.1.all.q/) which contains the OMPI session dir for the running job, and in turns would cause orted and the user processes to exit.

Maybe you could try qdel -f <jid> to force delete from the sge_qmaster, in case when sge_execd does not respond to the delete request by the sge_qmaster?

SLIM H.A. wrote:
I am using OpenMPI 1.2.3 with SGE 6.0u7 over InfiniBand (OFED 1.2),
following the recommendation in the OpenMPI FAQ

http://www.open-mpi.org/faq/?category=running#run-n1ge-or-sge

The job runs but when the user wants to delete the job with the qdel
command, this fails. Does the mpirun command

mpirun -np $NSLOTS ./exe

in the sge script require extra parameters?

Thanks for any advice

Henk

_______________________________________________
users mailing list
us...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/users


--

- Pak Lui
pak....@sun.com

Reply via email to