Re: [OMPI users] Q: Problems launching MPMD applications? ('mca_oob_tcp_peer_try_connect' error 103)

2007-12-06 Thread Ralph H Castain
On 12/5/07 8:47 AM, "Brian Dobbins" wrote: > Hi Josh, > >> I believe the problem is that you are only applying the MCA >> parameters to the first app context instead of all of them: > > Thank you very much.. applying the parameters with -gmca works fine with the > test case (and I'll try t

Re: [OMPI users] Q: Problems launching MPMD applications? ('mca_oob_tcp_peer_try_connect' error 103)

2007-12-05 Thread Brian Dobbins
Hi Josh, I believe the problem is that you are only applying the MCA > parameters to the first app context instead of all of them: Thank you *very* much.. applying the parameters with -gmca works fine with the test case (and I'll try the actual one soon). However and this is minor since it w

Re: [OMPI users] Q: Problems launching MPMD applications? ('mca_oob_tcp_peer_try_connect' error 103)

2007-12-05 Thread Josh Hursey
I believe the problem is that you are only applying the MCA parameters to the first app context instead of all of them: shell$ mpiexec -v -d -machinefile $PBS_NODEFILE -mca oob_tcp_if_exclude eth0 -mca pls_rsh_agent ssh -np 6 ./hwc.exe : -np 2 ./hwc.exe The '-mca' parameter will apply

Re: [OMPI users] Q: Problems launching MPMD applications? ('mca_oob_tcp_peer_try_connect' error 103)

2007-12-05 Thread Brian Dobbins
As a quick follow-up to my own post, I just tried this on a few other systems: 1) One system, on which the nodes have only one ethernet device, running the code with the split "-np" arguments works fine. 2) Another system, which has IB links (as default), runs the code fine. 3) Two very similar

[OMPI users] Q: Problems launching MPMD applications? ('mca_oob_tcp_peer_try_connect' error 103)

2007-12-05 Thread Brian Dobbins
Hi guys, I seem to have encountered an error while trying to run an MPMD executable through Open MPI's '-app' option, and I'm wondering if anyone else has seen this or can verify this? Backing up to a simple example, running a "hello world" executable (hwc.exe) works fine when run as: (using a