Received from Andy Riebs on Tue, Sep 23, 2014 at 03:57:49PM EDT:
> On 9/23/2014 3:07 PM, Lev Givon wrote:
> > Received from Andy Riebs on Tue, Sep 23, 2014 at 02:57:49PM EDT:

(snip)

> > > Lev, if you drop "mpiexec" from your command line, you should see
> > > the desired behaviour, i.e.,
> > >
> > > $ srun -n X program
> >
> > Doing so does launch the program only X times, but the communicator size 
> > seen by each
> > instance is 1, e.g., for the proverbial "Hello world" program, the output
> >
> > Hello, world, I am 0 of 1 (myhost)
> >
> > is generated X times.
> >
> > Incidentally, I verified that OpenMPI was build against PMI successfully:
> >
> > $ ldd /opt/openmpi-1.8.2/bin/mpiexec  | grep pmi
> >         libpmi.so.0 => /usr/lib/libpmi.so.0 (0x00002aed18f66000)
>
> Ahhh... try adding "--mpi=pmi" or "--mpi=pmi2" to your srun command.
> 
> Andy

That did the trick - thanks!

> p.s. If this fixes it, you might want to set the mpi default in
> slurm.conf appropriately.
-- 
Lev Givon
Bionet Group | Neurokernel Project
http://www.columbia.edu/~lev/
http://lebedov.github.io/
http://neurokernel.github.io/

Reply via email to