But if libpmi.so is provided by Slurm, why do I get the error messages? Does 
OpenMPI statically link libpmi.a which then depends
on an older version of libslurm.so?
If OpenMPI dynamically links agains libpmi.so which itself either links against 
libslurm.so or libslurm.so.28, shouldn't this
result in no direct dependency from OpenMPI to libslurm.so?



Am 16.04.2015 um 13:50 schrieb Ralph Castain:
> To be clear, we aren't linking to libslurm at all. The issue is that libpmi 
> is linking to it, and we link to libpmi. So I think
> you have to recompile to get the link dependencies correctly setup.
> 
> On Thu, Apr 16, 2015 at 5:32 AM, Uwe Sauter <[email protected] 
> <mailto:[email protected]>> wrote:
> 
> 
>     Hi Ralph,
> 
>     beside the mentioned libslurm.so.28 there is also a libslurm.so pointing 
> to the same libslurm.so.28.0.0 file. Perhaps OpenMPI
>     could use this link instead of the versioned on?
> 
>     File list in slurm/lib directory:
> 
>     -rw-r--r-- 1 slurm slurm   68992 Mar 20 11:39 libpmi.a
>     -rwxr-xr-x 1 slurm slurm    1016 Mar 20 11:39 libpmi.la <http://libpmi.la>
>     lrwxrwxrwx 1 slurm slurm      15 Mar 20 11:39 libpmi.so -> libpmi.so.0.0.0
>     lrwxrwxrwx 1 slurm slurm      15 Mar 20 11:39 libpmi.so.0 -> 
> libpmi.so.0.0.0
>     -rwxr-xr-x 1 slurm slurm   52800 Mar 20 11:39 libpmi.so.0.0.0
>     -rw-r--r-- 1 slurm slurm 8099794 Mar 20 11:39 libslurm.a
>     -rw-r--r-- 1 slurm slurm 8348210 Mar 20 11:39 libslurmdb.a
>     -rwxr-xr-x 1 slurm slurm    1006 Mar 20 11:39 libslurmdb.la 
> <http://libslurmdb.la>
>     lrwxrwxrwx 1 slurm slurm      20 Mar 20 11:39 libslurmdb.so -> 
> libslurmdb.so.28.0.0
>     lrwxrwxrwx 1 slurm slurm      20 Mar 20 11:39 libslurmdb.so.28 -> 
> libslurmdb.so.28.0.0
>     -rwxr-xr-x 1 slurm slurm 4115144 Mar 20 11:39 libslurmdb.so.28.0.0
>     -rwxr-xr-x 1 slurm slurm     992 Mar 20 11:39 libslurm.la 
> <http://libslurm.la>
>     lrwxrwxrwx 1 slurm slurm      18 Mar 20 11:39 libslurm.so -> 
> libslurm.so.28.0.0
>     lrwxrwxrwx 1 slurm slurm      18 Mar 20 11:39 libslurm.so.28 -> 
> libslurm.so.28.0.0
>     -rwxr-xr-x 1 slurm slurm 4012214 Mar 20 11:39 libslurm.so.28.0.0
>     drwxr-xr-x 2 slurm slurm    4096 Mar 20 11:40 pam
>     drwxr-xr-x 3 slurm slurm   12288 Mar 20 11:40 slurm
> 
> 
>     Regards,
> 
>             Uwe
> 
> 
>     Am 16.04.2015 um 13:27 schrieb Ralph Castain:
>     > Hmmm...yeah, it sounds like Slurm changed it's library names and/or 
> dependencies. I'm afraid that you do indeed need to recompile
>     > OMPI in that case. You probably need to rerun configure as well, just 
> to be safe.
>     >
>     > Sorry - outside OMPI's control :-/
>     >
>     >
>     > On Thu, Apr 16, 2015 at 5:22 AM, Uwe Sauter <[email protected] 
> <mailto:[email protected]> <mailto:[email protected] 
> <mailto:[email protected]>>> wrote:
>     >
>     >
>     >     Hi,
>     >
>     >     I have the case that OpenMPI was built against Slurm 14.03 (which 
> provided libslurm.so.27). Since upgrading to 14.11 I get errors
>     >     like:
>     >
>     >     [controller:35605] mca: base: component_find: unable to open
>     >     /opt/apps/openmpi/1.8.1/gcc/4.9/0/lib/openmpi/mca_ess_pmi:
>     >     libslurm.so.27: cannot open shared object file: No such file or 
> directory (ignored)
>     >
>     >     because now Slurm provides libslurmdb.so.28 .
>     >
>     >     I believe the only way to resolve this is to recompile OpenMPI… 
> correct?
>     >
>     >
>     >     Regards,
>     >
>     >             Uwe
>     >
>     >
>     >     Am 16.04.2015 um 13:18 schrieb Ralph Castain:
>     >     > No, you shouldn't have to do so - it's a dynamic library that 
> gets picked up at execution
>     >     >
>     >     >
>     >     > On Thu, Apr 16, 2015 at 2:55 AM, Bjørn-Helge Mevik 
> <[email protected] <mailto:[email protected]> 
> <mailto:[email protected] <mailto:[email protected]>>
>     >     <mailto:[email protected] <mailto:[email protected]> 
> <mailto:[email protected]
>     <mailto:[email protected]>>>> wrote:
>     >     >
>     >     >
>     >     >     We are considering compiling openmpi with 
> "--with-pmi=/opt/slurm" to
>     >     >     enable running mpi jobs with srun.
>     >     >
>     >     >     If we do this, will we have to recompile openmpi and/or 
> programs built
>     >     >     with openmpi when we upgrade slurm? (If so, only for major 
> upgrades, or
>     >     >     for minor upgrades as well?)
>     >     >
>     >     >     --
>     >     >     Regards,
>     >     >     Bjørn-Helge Mevik, dr. scient,
>     >     >     Department for Research Computing, University of Oslo
>     >     >
>     >     >
>     >
>     >
> 
> 

Reply via email to