Jeff Squyres wrote:
On Aug 4, 2008, at 10:02 AM, Jeff Squyres wrote:
I *think* George Bosilca sent some sample code about this across one
of the OMPI lists (users or devel) a long time ago. I'm not 100%
sure about that, though...
I unfortunately forget the trick that he used. :-\
George is unable to send e-mail at the moment, but he asked me to send
these links:
http://www.lam-mpi.org/MailArchives/lam/2003/06/6226.php
http://www.open-mpi.org/community/lists/users/2007/10/4327.php
And a simple example:
http://www.open-mpi.org/community/lists/users/2008/07/6168.php
Hope that helps.
Thanks for all your help.
I'm afraid I still could not make the simple example spawn work on
remote hosts listed in host file. Everything stays local. At least on 1.2.x
But I think I've got a path forward. I've been able to use sockets and
MPI_Comm_join to create intercomms between the singleton and
mpirun-spawned children. The important step I was missing was
"orted --persistent --seed --scope public". Now the MPI_Comm_join calls
in different worlds can see each other. FWIW, this technique just works
in LAM, presumably because lamboot is an explicit step.
I've almost got my head wrapped around the technique in
http://www.open-mpi.org/community/lists/users/2007/10/4327.php
Are there any shortcuts I could take for the case where all the clients
are already in a group?
--
Mark Borgerding
3dB Labs, Inc
Innovate. Develop. Deliver.