Title: Re: [slurm-dev] Re: Change in srun ?
|
Did you rebuild mpi with flag "--with-pmi" pointing at slurm's
include dir, after making sure slurm has pmi.h there? I usually
build with both --with-pmi and --with-slurm, although the last one
should be enabled by default.
Regards,
--Dani_L.
On 07/19/2017 10:38 AM, Gilles
Gouaillardet wrote:
Diego,
this is probably due to a change in the PMI interface.
i suggest you rebuilt your MPI library first, and then try again
Cheers,
Gilles
On 7/19/2017 4:20 PM, Diego Zuccato wrote:
Hello all.
I've just upgraded from Debian 8 to Debian 9, and that upgraded
slurm
from 14.03 to 16.05.
But now some MPI jobs are no more really parallel if run via
srun, but
work OK if run via mpirun.
The problem seems that srun launches N threads with
mpi_world_size=1 (so
every process thinks it's the only one and all threads work on
the same
dataset), while mpirun launches one thread with mpi_world_size=N
.
Did I miss something?
Tks.
|
smime.p7s
Description: S/MIME Cryptographic Signature