Hi,

i've been trying to test to test MPI_comm_spawn interface with slurm and 
pm=srun opposed to hydra provided by mpich2

i've rebuilt my mpich2 version with these flags:
./configure --with-pmi=slurm --with-pm=no --enable-shared

whatever i've tried so far i can't get it to spawn new commands for me:

int err = MPI_Comm_spawn(cmd, &char_argv[0], 1, NULL, 0, mpi_mgr.comm(), 
&intracomm, MPI_ERRCODES_IGNORE);

results in:

Fatal error in MPI_Comm_spawn: Other MPI error, error stack:
MPI_Comm_spawn(144)...........: MPI_Comm_spawn(cmd="spawn_tst", 
argv=0x7faea00276b0, maxprocs=1, info=0x9c000000, root=0, 
MPI_COMM_WORLD, intercomm=0x7fff96511850, errors=(nil)) failed
MPIDI_Comm_spawn_multiple(240): PMI_Spawn_multiple returned -1

all the code works fine via hydra. i'm curious if people are using srun 
pm with mpi_comm_spawn successfully or if there are some caveats/known 
issues i need to look out for ? i can't seem to even make the basics 
examples work, so i am sure i must be doing something wrong.

i'm using  slurm-2.5.4 and mpich2-1.5rc1

any help would be highly appreciated.

srun -N 2 -B '*:*:*' --exclusive mycmd args


Kind Regards,
Christoph

Reply via email to