Re: [OMPI users] [External] Re: Can't start jobs with srun.

2020-05-06 Thread Ralph Castain via users
The following (from what you posted earlier): $ srun --mpi=list srun: MPI types are... srun: none srun: pmix_v3 srun: pmi2 srun: openmpi srun: pmix would indicate that Slurm was built against a PMIx v3.x release. Using OMPI v4.0.3 with pmix=internal should be just fine so long as you set --mpi=p

Re: [OMPI users] [External] Re: Can't start jobs with srun.

2020-05-06 Thread Prentice Bisbal via users
No, and I fear that may be the problem. When we built OpenMPI, we did --with-pmix=internal. Not sure how Slurm was built, since my coworker built it. Prentice On 4/28/20 2:07 AM, Daniel Letai via users wrote: I know it's not supposed to matter, but have you tried building both ompi and sl

Re: [OMPI users] [External] Re: Can't start jobs with srun.

2020-04-27 Thread Prentice Bisbal via users
l Message- From: users [mailto:users-boun...@lists.open-mpi.org] On Behalf Of Prentice Bisbal via users Sent: Friday, April 24, 2020 2:19 PM To: Ralph Castain mailto:r...@open-mpi.org>>; Open MPI Users mailto:users@lists.open-mpi.org>> Cc: Prentice Bisbal mailto:pbis...@pppl.gov>&g

Re: [OMPI users] [External] Re: Can't start jobs with srun.

2020-04-24 Thread Prentice Bisbal via users
Okay. I've got Slurm built with pmix support: $ srun --mpi=list srun: MPI types are... srun: none srun: pmix_v3 srun: pmi2 srun: openmpi srun: pmix But now when I try to launch a job with srun, the job appears to be running, but doesn't do anything - it just hangs in the running state but does

Re: [OMPI users] [External] Re: Can't start jobs with srun.

2020-04-23 Thread Ralph Castain via users
You can trust the --mpi=list. The problem is likely that OMPI wasn't configured --with-pmi2 > On Apr 23, 2020, at 11:59 AM, Prentice Bisbal via users > wrote: > > --mpi=list shows pmi2 and openmpi as valid values, but if I set --mpi= to > either of them, my job still fails. Why is that? Can

Re: [OMPI users] [External] Re: Can't start jobs with srun.

2020-04-23 Thread Prentice Bisbal via users
--mpi=list shows pmi2 and openmpi as valid values, but if I set --mpi= to either of them, my job still fails. Why is that? Can I not trust the output of --mpi=list? Prentice On 4/23/20 10:43 AM, Ralph Castain via users wrote: No, but you do have to explicitly build OMPI with non-PMIx support

Re: [OMPI users] [External] Re: Can't start jobs with srun.

2020-04-23 Thread Ralph Castain via users
No, but you do have to explicitly build OMPI with non-PMIx support if that is what you are going to use. In this case, you need to configure OMPI --with-pmi2= You can leave off the path if Slurm (i.e., just "--with-pmi2") was installed in a standard location as we should find it there. > On A

Re: [OMPI users] [External] Re: Can't start jobs with srun.

2020-04-23 Thread Prentice Bisbal via users
It looks like it was built with PMI2, but not PMIx: $ srun --mpi=list srun: MPI types are... srun: none srun: pmi2 srun: openmpi I did launch the job with srun --mpi=pmi2 Does OpenMPI 4 need PMIx specifically? On 4/23/20 10:23 AM, Ralph Castain via users wrote: Is Slurm built with PMIx