Update: In the interest of helping out anyone else in the future who may
have my problem, I'm posting what the solution to the problem.
All I had to do was add the line
JobSubmitPlugins=job_submit/require_timelimit
to the slurm.conf. It would have saved so much time and trouble if this
Felix,
How does the routing table look on the controller?
Is the IB network listed on the controller using the correct interface?
John DeSantis
2015-03-19 10:48 GMT-04:00 Felix Willenborg felix.willenb...@uni-oldenburg.de:
So i tried out installing the latest package (14.11.4-1) of slurm
Michael,
Try running slurmctld -D which should result in output telling you
what's going wrong.
Andy
On 03/19/2015 01:15 PM, Michael Kit
Gilbert wrote:
Newb question about plugins
Sorry for the basic question, but I am new to slurm
and am having some basic problems
So i tried out installing the latest package (14.11.4-1) of slurm with
no success - unfortunately. I kept an eye on the compilation of the
Infiniband Plugin, that it is loaded in the slurmd and that a
acct_gathering.conf is available. Still, i have the same problem. I
assume that i'm not
Thank you so much for the reply, Andy. Well, apparently there's a lot
happening that may be causing the issue. First, I can't seem to get
slurmctld running properly. When I run slurmctld -D, this is my output:
slurmctld: error: Can't save state, create file
/var/spool/slurm/last_config_lite.new
Update: So, I have figured out the problem with slurm not running properly.
It had to do with my fstab file being incorrect and not mounting /var/spool
correctly.
Now I can start slurm correctly. However, when trying to run a job, slurm
doesn't load the plugin properly, so it fails with the
OK, we (or at least I) have reached the point where you need to
provide some more information:
* What operating system and version?
* What Slurm version?
* What compiler?
You apparently have some kind of build problem, as Slurm plugins are
required to export a specific set of symbols;
Version 14.11.5 contains quite a few bug fixes generated over the past
five weeks including two high impact bugs. There is a fix for the
slurmdbd daemon aborting if a node is set to a DOWN state and it's
reason field is NULL. The other important bug fix will prevent
someone from being
Thanks again for the help!
OS: CentOS 6.5
Slurm version: 14.11.2
Compiler: gcc 4.4.7
Getting rid of the plugstack.conf file allowed me to start running jobs
again, but the plugin that I'm wanting to work doesn't appear to be enabled.
There are a bunch of *.so plugin files in the /usr/lib64/slurm