Received from Andy Riebs on Tue, Sep 23, 2014 at 03:57:49PM EDT: > On 9/23/2014 3:07 PM, Lev Givon wrote: > > Received from Andy Riebs on Tue, Sep 23, 2014 at 02:57:49PM EDT:
(snip) > > > Lev, if you drop "mpiexec" from your command line, you should see > > > the desired behaviour, i.e., > > > > > > $ srun -n X program > > > > Doing so does launch the program only X times, but the communicator size > > seen by each > > instance is 1, e.g., for the proverbial "Hello world" program, the output > > > > Hello, world, I am 0 of 1 (myhost) > > > > is generated X times. > > > > Incidentally, I verified that OpenMPI was build against PMI successfully: > > > > $ ldd /opt/openmpi-1.8.2/bin/mpiexec | grep pmi > > libpmi.so.0 => /usr/lib/libpmi.so.0 (0x00002aed18f66000) > > Ahhh... try adding "--mpi=pmi" or "--mpi=pmi2" to your srun command. > > Andy That did the trick - thanks! > p.s. If this fixes it, you might want to set the mpi default in > slurm.conf appropriately. -- Lev Givon Bionet Group | Neurokernel Project http://www.columbia.edu/~lev/ http://lebedov.github.io/ http://neurokernel.github.io/
