On 16/04/16 21:51, John DeSantis wrote:

> Anyways, we have experienced a random(?) slurmctld failure resulting in
> a segfault twice this week. 

15.08.4 is pretty old (last November), 15.08.10 was released April 6th
and includes a fix for a race condition that could crash slurmctld.

http://schedmd.com/#154

It also looks like there is a slurmctld crash fix coming in 15.08.11:

https://github.com/SchedMD/slurm/blob/slurm-15.08/NEWS

All the best!
Chris
-- 
 Christopher Samuel        Senior Systems Administrator
 VLSCI - Victorian Life Sciences Computation Initiative
 Email: [email protected] Phone: +61 (0)3 903 55545
 http://www.vlsci.org.au/      http://twitter.com/vlsci

Reply via email to