Slurm version 15.08.10 is now available. It includes a fix for a race
condition which can result in an invalid memory reference, likely
causing the slurmctld daemon to crash. Other changes described below.
Slurm downloads are available from
http://www.schedmd.com/#repos
* Changes in Slurm 15.08.10
===========================
-- Fix issue where if a slurmdbd rollup lasted longer than 1 hour the
rollup would effectively never run again.
-- Make error message in the pmi2 code to debug as the issue can be
expected
and retries are done making the error message a little misleading.
-- Power/cray: Don't specify NID list to Cray APIs. If any of those
nodes are
not in a ready state, the API returned an error for ALL nodes rather
than
valid data for nodes in ready state.
-- Fix potential divide by zero when tree_width=1.
-- checkpoint/blcr plugin: Fix memory leak.
-- If using PrologFlags=contain: Don't launch the extern step if a job
is
cancelled while launching.
-- Remove duplicates from AccountingStorageTRES
-- Fix backfill scheduler race condition that could cause invalid
pointer in
select/cons_res plugin. Bug introduced in 15.08.9.
-- Avoid double calculation on partition QOS if the job is using the
same QOS.
-- Do not change a job's time limit when updating unrelated field in a
job.