I suspect the problem is that the rsh/ssh launcher is attempting to use a tree pattern for launching the apps - i.e., mpirun launches a daemon on the first couple of nodes, and then those daemons launch daemons on the next level. If rsh/ssh isn't supported on those backend nodes, then this won't work.
Try running it with "-mca plm_rsh_no_tree_spawn 1" on your cmd line. This will instruct OMPi to not use a tree pattern, but to have mpirun directly launch the daemons itself. On Jan 23, 2013, at 5:41 AM, Ada Mancuso <mancuso....@gmail.com> wrote: > Yes I can but with at most two machines as slave and one machine as master, > If I try to add another one as slave I get those errors. > > Il giorno 23/gen/2013 14:38, "Jeff Squyres (jsquyres)" <jsquy...@cisco.com> > ha scritto: > I'm not sure I understand you. Does Open MPI work across multiple machines? > I.e., can you do all three of those steps across multiple machines? > > On Jan 23, 2013, at 8:16 AM, Ada Mancuso <mancuso....@gmail.com> > wrote: > > > I'm sure that openmpi works, morever my problem happens only with more than > > 2 slaves (on different machines while in local it greatly works with any > > number of slaves). > > Thanks > > Ada > > > > Il giorno 23/gen/2013 14:04, "Jeff Squyres (jsquyres)" <jsquy...@cisco.com> > > ha scritto: > > Are you able to run the C examples in the examples/ directory from the > > tarball? > > > > Our README suggests the following: > > > > ----- > > When verifying a new Open MPI installation, we recommend running three > > tests: > > > > 1. Use "mpirun" to launch a non-MPI program (e.g., hostname or uptime) > > across multiple nodes. > > > > 2. Use "mpirun" to launch a trivial MPI program that does no MPI > > communication (e.g., the hello_c program in the examples/ directory > > in the Open MPI distribution). > > > > 3. Use "mpirun" to launch a trivial MPI program that sends and > > receives a few MPI messages (e.g., the ring_c program in the > > examples/ directory in the Open MPI distribution). > > > > If you can run all three of these tests successfully, that is a good > > indication that Open MPI built and installed properly. > > ----- > > > > > > On Jan 23, 2013, at 7:41 AM, Ada Mancuso <mancuso....@gmail.com> > > wrote: > > > > > Hi, > > > I've installed the latest snapshot taken from svn developer's trunk but I > > > had the same problems. This is my configuration: > > > • Ubuntu 2.6.38-8 kernel > > > • Openssh_5.8p1 openssl 0.9.8o > > > • Libtool version 2.4 > > > • Open mpi 1.7 rc5 and latest snapshots. > > > Do you think my problem could be related with the operating system used > > > or with any parameter or configuration? I've also checked the ssh log > > > file but I didn't find any problem. > > > Thanks in advance > > > Ada > > > > > > > > > > > > Il giorno martedì 22 gennaio 2013, Ralph Castain ha scritto: > > > > > > > > Ouch - no, you'd have to take it from the developer's trunk, either via > > > > svn checkout or the nightly developer's snapshot > > > > > > > > On Jan 22, 2013, at 12:35 PM, Ada Mancuso <mancuso....@gmail.com> wrote: > > > > > > > > My problem is that I have to use openmpi 1.7 rc5 because I'm using the > > > > Java binding mpijava... Is it present in the latest snapshot you told > > > > me? If so where can I find it? > > > > Thanks a lot > > > > Ada > > > > > > > > Il giorno 22/gen/2013 21:03, "Ralph Castain" <r...@open-mpi.org> ha > > > > scritto: > > > >> > > > >> It seems to be working fine for me with the latest 1.7 tarball (not > > > >> rc5 - I didn't test that one). Could be there was a problem that has > > > >> since been fixed. We are getting ready to release an updated rc, so > > > >> you might want to try it (or use the latest nightly 1.7 snapshot). > > > >> > > > >> > > > >> On Jan 22, 2013, at 9:57 AM, Ada Mancuso <mancuso....@gmail.com> wrote: > > > >> > > > >> Hi, > > > >> I'm trying to run my mpi program using open mpi 1.7 rc5 on 4 machines > > > >> using the command: > > > >> mpirun -np4 -hostfile file a.out > > > >> but i get the following message errors: > > > >> ORTE_ERROR_LOG: A message is attempting to be sent to a process whose > > > >> contact information is unknown in file > > > >> ../../../../../ompi/orte/mca/rml/oob/rml_oob_send.c > > > >> attempted to send to [[21341,0],2]: tag 15 > > > >> ORTE_ERROR_LOG: A message is attempting to be sent to a process whose > > > >> contact information is unknown in file > > > >> ../../../../ompi/orte/mca/grpcomm/base/grpcomm_base_xcast.c > > > >> The file etc/hosts is composed by ipaddress hostname, I have exchange > > > >> ssh keys among the machines and ssh login works without requiring > > > >> authentication password. Surprisingly if I try to run my program with > > > >> at most 2 hosts, and so the file hosts contains only two hosts, it > > > >> works but if i try to run my program with more than two hosts i have > > > >> this error; mpi works well on each machine and I also tried to run my > > > >> program with different couple of machines in order to be sure that no > > > >> machine could be the problem. > > > >> Can you help me please? > > > >> Ada > > > >> _______________________________________________ > > > >> users mailing list > > > >> us...@open-mpi.org > > > >> http://www.open-mpi.org/mailman/listinfo.cgi/users > > > >> > > > >> > > > >> > > > >> _______________________________________________ > > > >> users mailing list > > > >> us...@open-mpi.org > > > >> http://www.open-mpi.org/mailman/listinfo.cgi/users > > > > > > > > _______________________________________________ > > > > users mailing list > > > > us...@open-mpi.org > > > > http://www.open-mpi.org/mailman/listinfo.cgi/users > > > > > > > > > > > _______________________________________________ > > > users mailing list > > > us...@open-mpi.org > > > http://www.open-mpi.org/mailman/listinfo.cgi/users > > > > > > -- > > Jeff Squyres > > jsquy...@cisco.com > > For corporate legal information go to: > > http://www.cisco.com/web/about/doing_business/legal/cri/ > > > > > > _______________________________________________ > > users mailing list > > us...@open-mpi.org > > http://www.open-mpi.org/mailman/listinfo.cgi/users > > _______________________________________________ > > users mailing list > > us...@open-mpi.org > > http://www.open-mpi.org/mailman/listinfo.cgi/users > > > -- > Jeff Squyres > jsquy...@cisco.com > For corporate legal information go to: > http://www.cisco.com/web/about/doing_business/legal/cri/ > > > _______________________________________________ > users mailing list > us...@open-mpi.org > http://www.open-mpi.org/mailman/listinfo.cgi/users > _______________________________________________ > users mailing list > us...@open-mpi.org > http://www.open-mpi.org/mailman/listinfo.cgi/users