Slurm version 2.4.4 is now available from

http://www.schedmd.com/#repos

Changes since version 2.4.3 are almost all bug fixes for IBM  
Bluegene/Q systems as listed below.

We plan to tag version 2.5.0-rc1 (release candidate 1) next week, just  
before the SC12 conference. For those of you attending SC12, there is  
a Slurm BOF on Thursday at 12:15 and will be a Slurm booth on the  
exhibit floor.

* Changes in SLURM 2.4.4
========================
  -- BGQ - minor fix to make build work in emulated mode.
  -- BGQ - Fix if large block goes into error and the next highest  
priority jobs
     are planning on using the block.  Previously it would fail those jobs
     erroneously.
  -- BGQ - Fix issue when a cnode going to an error (not SoftwareError) state
     with a job running or trying to run on it.
  -- Execute slurm_spank_job_epilog when there is no system Epilog configured.
  -- Fix for srun --test-only to work correctly with timelimits
  -- BGQ - If a job goes away while still trying to free it up in the
     database, and the job is running on a small block make sure we free up
     the correct node count.
  -- BGQ - Logic added to make sure a job has finished on a block before it is
     purged from the system if its front-end node goes down.
  -- Modify strigger so that a filter option of "--user=0" is supported.
  -- Correct --mem-per-cpu logic for core or socket allocations with multiple
     threads per core.
  -- Fix for older < glibc 2.4 systems to use euidaccess() instead of  
eaccess().
  -- BLUEGENE - Do not alter a pending job's node count when changing it's
     partition.
  -- BGQ - Add functionality to make it so we track the actions on a block.
     This is needed for when a free request is added to a block but there are
     jobs finishing up so we don't start new jobs on the block since they will
     fail on start.
  -- BGQ - Fixed InactiveLimit to work correctly to avoid scenarios where a
     user's pending allocation was started with srun and then for some reason
     the slurmctld was brought down and while it was down the srun was removed.
  -- Fixed InactiveLimit math to work correctly
  -- BGQ - Add logic to make it so blocks can't use a midplane with a nodeboard
     in error for passthrough.
  -- BGQ - Make it so if a nodeboard goes in error any block using  
that midplane
     for passthrough gets removed on a dynamic system.
  -- BGQ - Fix for printing realtime server debug correctly.
  -- BGQ - Cleaner handling of cnode failures when reported through the runjob
     interface instead of through the normal method.
  -- smap - spread node information across multiple lines for larger systems.
  -- Cray - Defer salloc until after PrologSlurmctld completes.
  -- Correction to slurmdbd communications failure handling logic, incorrect
     error codes returned in some cases.

Reply via email to