Slurm version 16.05.1 is now available and includes about 40 bug fixes
developed over the past month as listed below.
Slurm downloads are available from:
http://www.schedmd.com/#repos
* Changes in Slurm 16.05.1
==========================
-- Fix __cplusplus macro in spank.h to allow compilation with C++.
-- Fix compile issue with older glibc < 2.12
-- Fix for starting batch step with mpi/pmix plugin.
-- Fix for "scontrol -dd show job" with respect to displaying the
specific
CPUs allocated to a job on each node. Prior logic would only display
the CPU information for the first node in the job allocation.
-- Print correct return code on failure to update active node features
through sview.
-- Allow QOS timelimit to override partition timelimit when
EnforcePartLimits
is set to all/any.
-- Make it so qsub will do a "basename" on a wrapped command for the
output
and error files.
-- Fix issue where slurmd could core when running the ipmi energy
plugin.
-- Documentation - clean up typos.
-- Add logic so that slurmstepd can be launched under valgrind.
-- Increase buffer size to read /proc/*/stat files.
-- Fix for tracking job resource allocation when slurmctld is
reconfigured
while Cray Node Health Check (NHC) is running. Previous logic would
fail to
record the job's allocation then perform release operation upon NHC
completion, resulting in underflow error messages.
-- Make "scontrol show daemons" work with long node names.
-- CRAY - Collect energy using a uint64_t instead of uint32_t.
-- Fix incorrect if statements when determining if the user has a
default
account or wckey.
-- Prevent job stuck in configuring state if slurmctld daemon restarted
while
PrologSlurmctld is running. Also re-issue burst_buffer/pre-load
operation
as needed.
-- Correct task affinity support for FreeBSD.
-- Fix for task affinity on KNL in SNC2/Flat mode.
-- Recalculate a job's memory allocation after node reboot if job
requests all
of a node's memory and FastSchedule=0 is configured. Intel KNL
memory size
can change on reboot with various MCDRAM modes.
-- Fix small memory leak when printing HealthCheckNodeState.
-- Eliminate memory leaks when AuthInfo is configured.
-- Improve sdiag output description in man page.
-- Cray/capmc_resume script modify a node's features (as needed) when
the
reinit (reboot) command is issued rather than wait for the nodes to
change
to the "on" state.
-- Correctly print ranges when using step values in job arrays.
-- Allow from file names / paths over 256 characters when launching
steps,
as well as spaces in the executable name.
-- job_submit.license.lua example modified to send message back to
user.
-- Document job --mem=0 option means all memory on a node.
-- Set SLURM_JOB_QOS environment variable to QOS name instead of
description.
-- knl_cray.conf file option of CnselectPath added.
-- node_features/knl_cray plugin modified to get current node NUMA and
MCDRAM
modes using cnselect command rather than capmc command.
-- liblua - add SLES12 paths to runtime search list.
-- Fix qsub default output and error files for task arrays.
-- Fix qsub to set job_name correctly when wrapping a script (-b y)
-- Cray - set EnforcePartLimits=any in slurm.conf template.