Hi all,

we are facing problems while executing mpi programs.if all the client 
nodes are up, then the mpi application works fine. But
in case if any one of the node is down then the execution terminates.

Here is one sample output:(specifically oscarnode2 is dead)

# mpirun -np 4 ./a.out
oscarserver.oscardomain
Sun Mar 27 23:13:15 IST 2005
oscarnode1.oscardomain
Sun Mar 27 23:13:25 IST 2005
ssh: connect to host oscarnode2 port 22: No route to host
p0_14474:  p4_error: Child process exited while making connection to 
remote process on oscarnode2: 0
Killed by signal 2.

/opt/mpich-1.2.5.10-ch_p4-gcc/bin/mpirun: line 1: 14474 Broken pipe   
           /home/oscartst/suresh/./a.out -p4pg 
/home/oscartst/suresh/PI14350 -p4wd /home/oscartst/suresh

But it should not affect the performance of the cluster even if one 
of the node is dead right?

Eagerly waiting for the response.

ullas



-- 
______________________________________________
Check out the latest SMS services @ http://www.linuxmail.org 
This allows you to send and receive SMS through your mailbox.


Powered by Outblaze


-------------------------------------------------------
SF email is sponsored by - The IT Product Guide
Read honest & candid reviews on hundreds of IT Products from real users.
Discover which products truly live up to the hype. Start reading now.
http://ads.osdn.com/?ad_ide95&alloc_id396&op=click
_______________________________________________
Oscar-users mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/oscar-users

Reply via email to