[slurm-dev] Re: Newb question about plugins

2015-03-19 Thread Michael Kit Gilbert
Update: In the interest of helping out anyone else in the future who may have my problem, I'm posting what the solution to the problem. All I had to do was add the line JobSubmitPlugins=job_submit/require_timelimit to the slurm.conf. It would have saved so much time and trouble if this

[slurm-dev] Re: Slurm is refusing to establish a connection between nodes and controller

2015-03-19 Thread John Desantis
Felix, How does the routing table look on the controller? Is the IB network listed on the controller using the correct interface? John DeSantis 2015-03-19 10:48 GMT-04:00 Felix Willenborg felix.willenb...@uni-oldenburg.de: So i tried out installing the latest package (14.11.4-1) of slurm

[slurm-dev] Re: Newb question about plugins

2015-03-19 Thread Andy Riebs
Michael, Try running slurmctld -D which should result in output telling you what's going wrong. Andy On 03/19/2015 01:15 PM, Michael Kit Gilbert wrote: Newb question about plugins Sorry for the basic question, but I am new to slurm and am having some basic problems

[slurm-dev] Re: Slurm is refusing to establish a connection between nodes and controller

2015-03-19 Thread Felix Willenborg
So i tried out installing the latest package (14.11.4-1) of slurm with no success - unfortunately. I kept an eye on the compilation of the Infiniband Plugin, that it is loaded in the slurmd and that a acct_gathering.conf is available. Still, i have the same problem. I assume that i'm not

[slurm-dev] Re: Newb question about plugins

2015-03-19 Thread Michael Kit Gilbert
Thank you so much for the reply, Andy. Well, apparently there's a lot happening that may be causing the issue. First, I can't seem to get slurmctld running properly. When I run slurmctld -D, this is my output: slurmctld: error: Can't save state, create file /var/spool/slurm/last_config_lite.new

[slurm-dev] Re: Newb question about plugins

2015-03-19 Thread Michael Kit Gilbert
Update: So, I have figured out the problem with slurm not running properly. It had to do with my fstab file being incorrect and not mounting /var/spool correctly. Now I can start slurm correctly. However, when trying to run a job, slurm doesn't load the plugin properly, so it fails with the

[slurm-dev] Re: Newb question about plugins

2015-03-19 Thread Andy Riebs
OK, we (or at least I) have reached the point where you need to provide some more information: * What operating system and version? * What Slurm version? * What compiler? You apparently have some kind of build problem, as Slurm plugins are required to export a specific set of symbols;

[slurm-dev] Slurm versions 14.11.5 and 15.08.0-pre3 are now available

2015-03-19 Thread Moe Jette
Version 14.11.5 contains quite a few bug fixes generated over the past five weeks including two high impact bugs. There is a fix for the slurmdbd daemon aborting if a node is set to a DOWN state and it's reason field is NULL. The other important bug fix will prevent someone from being

[slurm-dev] Re: Newb question about plugins

2015-03-19 Thread Michael Kit Gilbert
Thanks again for the help! OS: CentOS 6.5 Slurm version: 14.11.2 Compiler: gcc 4.4.7 Getting rid of the plugstack.conf file allowed me to start running jobs again, but the plugin that I'm wanting to work doesn't appear to be enabled. There are a bunch of *.so plugin files in the /usr/lib64/slurm