It has been our experience that it is necessary to rebuild OpenMPI
for each major slurm release, such as transitioning from Slurm
14.03.x to 14.11.x.
Andy
On 04/16/2015 07:49 AM, Ralph Castain
wrote:
Re: [slurm-dev] Re: Need for recompiling openmpi built with
--with-pmi?
To be clear, we aren't linking to libslurm at all.
The issue is that libpmi is linking to it, and we link to
libpmi. So I think you have to recompile to get the link
dependencies correctly setup.
On Thu, Apr 16, 2015 at 5:32 AM, Uwe
Sauter <[email protected]>
wrote:
Hi Ralph,
beside the mentioned libslurm.so.28 there is also a
libslurm.so pointing to the same libslurm.so.28.0.0 file.
Perhaps OpenMPI
could use this link instead of the versioned on?
File list in slurm/lib directory:
-rw-r--r-- 1 slurm slurm 68992 Mar 20 11:39 libpmi.a
-rwxr-xr-x 1 slurm slurm 1016 Mar 20 11:39 libpmi.la
lrwxrwxrwx 1 slurm slurm 15 Mar 20 11:39 libpmi.so
-> libpmi.so.0.0.0
lrwxrwxrwx 1 slurm slurm 15 Mar 20 11:39 libpmi.so.0
-> libpmi.so.0.0.0
-rwxr-xr-x 1 slurm slurm 52800 Mar 20 11:39
libpmi.so.0.0.0
-rw-r--r-- 1 slurm slurm 8099794 Mar 20 11:39 libslurm.a
-rw-r--r-- 1 slurm slurm 8348210 Mar 20 11:39 libslurmdb.a
-rwxr-xr-x 1 slurm slurm 1006 Mar 20 11:39 libslurmdb.la
lrwxrwxrwx 1 slurm slurm 20 Mar 20 11:39 libslurmdb.so
-> libslurmdb.so.28.0.0
lrwxrwxrwx 1 slurm slurm 20 Mar 20 11:39
libslurmdb.so.28 -> libslurmdb.so.28.0.0
-rwxr-xr-x 1 slurm slurm 4115144 Mar 20 11:39
libslurmdb.so.28.0.0
-rwxr-xr-x 1 slurm slurm 992 Mar 20 11:39 libslurm.la
lrwxrwxrwx 1 slurm slurm 18 Mar 20 11:39 libslurm.so
-> libslurm.so.28.0.0
lrwxrwxrwx 1 slurm slurm 18 Mar 20 11:39 libslurm.so.28
-> libslurm.so.28.0.0
-rwxr-xr-x 1 slurm slurm 4012214 Mar 20 11:39
libslurm.so.28.0.0
drwxr-xr-x 2 slurm slurm 4096 Mar 20 11:40 pam
drwxr-xr-x 3 slurm slurm 12288 Mar 20 11:40 slurm
Regards,
Uwe
Am 16.04.2015 um 13:27 schrieb Ralph Castain:
> Hmmm...yeah, it sounds like Slurm changed it's
library names and/or dependencies. I'm afraid that you do
indeed need to recompile
> OMPI in that case. You probably need to rerun
configure as well, just to be safe.
>
> Sorry - outside OMPI's control :-/
>
>
> On Thu, Apr 16, 2015 at
5:22 AM, Uwe Sauter <[email protected]
<mailto:[email protected]>>
wrote:
>
>
> Hi,
>
> I have the case that OpenMPI was built against
Slurm 14.03 (which provided libslurm.so.27). Since
upgrading to 14.11 I get errors
> like:
>
> [controller:35605] mca: base: component_find:
unable to open
>
/opt/apps/openmpi/1.8.1/gcc/4.9/0/lib/openmpi/mca_ess_pmi:
> libslurm.so.27: cannot open shared object file:
No such file or directory (ignored)
>
> because now Slurm provides libslurmdb.so.28 .
>
> I believe the only way to resolve this is to
recompile OpenMPI… correct?
>
>
> Regards,
>
> Uwe
>
>
> Am 16.04.2015 um 13:18 schrieb Ralph Castain:
> > No, you shouldn't have to do so - it's a
dynamic library that gets picked up at execution
> >
> >
> > On Thu, Apr 16, 2015 at 2:55 AM, Bjørn-Helge
Mevik <[email protected]
<mailto:[email protected]>
> <mailto:[email protected]
<mailto:[email protected]>>>
wrote:
> >
> >
> > We are considering compiling openmpi
with "--with-pmi=/opt/slurm" to
> > enable running mpi jobs with srun.
> >
> > If we do this, will we have to
recompile openmpi and/or programs built
> > with openmpi when we upgrade slurm?
(If so, only for major upgrades, or
> > for minor upgrades as well?)
> >
> > --
> > Regards,
> > Bjørn-Helge Mevik, dr. scient,
> > Department for Research Computing,
University of Oslo
> >
> >
>
>