I'd like a second revival of this thread! The full thread is available
at
https://groups.google.com/forum/#!msg/slurm-devel/oDoHPoAbiPQ/q9pQL2Uw3y0J
We're in the process of upgrading Slurm from 16.05 to 17.02. I'd like
to be certain that our MPI libraries don't require a specific library
version such as libslurm.so.30. See the thread's example "$ readelf -d
libmca_common_pmi.so":
0x0000000000000001 (NEEDED) Shared library: [libslurm.so.27]
Question: Can anyone suggest which OpenMPI libraries I have to go
through with readelf in order to make sure we don't have the
libslurm.so.xx problem?
The libmca_common_pmi.so file doesn't exist on our systems. We have
OpenMPI 1.10.3 and 2.0.2 installed with EasyBuild.
Our builds of OpenMPI were done on top of a Slurm 16.05 base, and our
build hosts do **not** have the lib64/libpmi2.la and lib64/libpmi.la
which cause problems. According to the above thread, these files were
removed from the slurm-devel RPM package starting from Slurm 16.05. So
I hope that we're good...
I expect the consequences of having an undetected libslurm.so.xx problem
would be that all MPI jobs would start crashing :-(
Thanks for your help,
Ole
On 02/04/2016 11:26 PM, Kilian Cavalotti wrote:
Hi all,
I would like to revive this old thread, as we've been bitten by this
also when moving from 14.11 to 15.08.
On Mon, Oct 5, 2015 at 4:38 AM, Bjørn-Helge Mevik <[email protected]> wrote:
We have verified that we can compile openmpi (1.8.6) against slurm
14.03.7 (with the .la files removed), and then upgrade slurm to 15.08.0
without having to recompile openmpi.
My understanding of linking and libraries is not very thorough,
unfortunately, but according to
https://lists.fedoraproject.org/pipermail/mingw/2012-January/004421.html
the .la files are only needed in order to link against static libraries,
and since Slurm doesn't provide any static libraries, I guess it would
be safe for the slurm-devel rpm not to include these files.
I think the link above describes the situation pretty well. Could we
please remove the .la files from the slurm-devel RPM if they don't
serve any specific purpose?
The attached patch to slurm.spec worked for me.