On Aug 11, 2009, at 5:17 AM, Ashley Pittman wrote:

On Tue, 2009-08-11 at 03:03 -0600, Ralph Castain wrote:
If it isn't already there, try putting a print statement tight at
program start, another just prior to MPI_Init, and another just after
MPI_Init. It could be that something is hanging somewhere during
program startup since it sounds like everything is launching just
fine.

If you suspect a hang then you can use the command orte-ps (on the node where the mpirun is running) and it should show you your job. This will
tell you if the job is started and still running or if there was a
problem launching.

If the program did start and has really hung then you can get more
in-depth information about it using padb which is linked to in my
signature.

FWIW: we use padb for this purpose, and it is very helpful!

Ralph


Ashley,

--

Ashley Pittman, Bath, UK.

Padb - A parallel job inspection tool for cluster computing
http://padb.pittman.org.uk

_______________________________________________
users mailing list
us...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/users

Reply via email to