In an attempt to allow myself and users to test 14.11.6 before we update
our 14.03.10 installation I've installed SLURM to our apps repository and
created a loadable module to access the test instance of SLURM.  When I
execute sbatch commands I get the following:

$ sbatch mhd-test.slrm
sbatch: error: Couldn't load specified plugin name for select/alps: Plugin
missing a required symbol use debug3 to see
sbatch: error: Couldn't load specified plugin name for select/serial:
Plugin missing a required symbol use debug3 to see
sbatch: error: Couldn't load specified plugin name for select/cons_res:
Plugin missing a required symbol use debug3 to see
sbatch: error: Couldn't load specified plugin name for select/bluegene:
Plugin missing a required symbol use debug3 to see
sbatch: error: Couldn't load specified plugin name for select/linear:
Plugin missing a required symbol use debug3 to see
sbatch: error: Couldn't load specified plugin name for select/cray: Plugin
missing a required symbol use debug3 to see
sbatch: fatal: Can't find plugin for select/cons_res

The test slurmctld has debug3 enabled and prints this:

[2015-04-30T11:23:30.304] debug:  _slurm_recv_timeout at 0 of 4, recv zero
bytes
[2015-04-30T11:23:30.304] error: slurm_receive_msg: Zero Bytes were
transmitted or received
[2015-04-30T11:23:30.314] error: slurm_receive_msg: Zero Bytes were
transmitted or received

The controller + slurmdbd are already on 14.11.6 as are the nodes in this
test cluster.

I built this test install of SLURM from source.  Our production install of
SLURM is via RPM.  I build our RPMs in mock, so the steps I took to build
this version from source were taken from the steps used during rpmbuild in
mock.

CFLAGS='-O2 -g -pipe -Wall -Wp,-D_FORTIFY_SOURCE=2 -fexceptions
-fstack-protector --param=ssp-buffer-size=4 -m64 -mtune=generic' \
CXXFLAGS='-O2 -g -pipe -Wall -Wp,-D_FORTIFY_SOURCE=2 -fexceptions
-fstack-protector --param=ssp-buffer-size=4 -m64 -mtune=generic' \
FFLAGS='-O2 -g -pipe -Wall -Wp,-D_FORTIFY_SOURCE=2 -fexceptions
-fstack-protector --param=ssp-buffer-size=4 -m64 -mtune=generic
-I/usr/lib64/gfortran/modules' \
./configure --build=x86_64-redhat-linux-gnu \
--host=x86_64-redhat-linux-gnu \
--target=x86_64-redhat-linux-gnu \
--program-prefix= \
--prefix=/apps/slurm/14.11.6

make && make install

I copied our 14.11.6 test config into /apps/slurm/14.11.6/etc.  These are
environment variables being set by the loaded module:

SLURM_CONF=/apps/slurm/14.11.6/etc/slurm.conf
PATH=/apps/slurm/14.11.6/bin:$PATH
MANPATH=/apps/slurm/14.11.6/share/man:$MANPATH
LD_LIBRARY_PATH=/apps/slurm/14.11.6/lib/slurm:/apps/slurm/14.11.6/lib:$LD_LIBRARY_PATH
LIBRARY_PATH=/apps/slurm/14.11.6/lib/slurm:/apps/slurm/14.11.6/lib:$LIBRARY_PATH

I feel like I'm missing something obvious that results in the plugins
failing to load.  An FAQ entry [1] looks similar but unsure if this is the
same problem as described there.

[1]: http://slurm.schedmd.com/faq.html#inc_plugin

Thanks,
- Trey

=============================

Trey Dockendorf
Systems Analyst I
Texas A&M University
Academy for Advanced Telecommunications and Learning Technologies
Phone: (979)458-2396
Email: [email protected]
Jabber: [email protected]

Reply via email to