Are you looking at the man page on the SchedMD website, or on your computer? If you're looking at the website, those pages are for the latest version and may not match what you have installed, so this could be a feature in a later version tha 18.08.

--
Prentice

On 8/7/20 11:43 AM, Hanby, Mike wrote:

Howdy, (Slurm 18.08)

We have a bunch of node that we've updated to "scontrol reboot ASAP".

We'd like to cancel a few of those. From the man page, it's suggested that either of the following should work, however both report the same error " slurm_update error: Invalid node state specified":

scontrol cancel_reboot c01

or

scontrol Update NodeName=c01 State=CANCEL_REBOOT

Here's the 'scontrol show node c01' info for reference:

NodeName=c01 Arch=x86_64 CoresPerSocket=12

CPUAlloc=7 CPUTot=24 CPULoad=7.04

AvailableFeatures=(null)

ActiveFeatures=(null)

Gres=(null)

NodeAddr=c0115 NodeHostName=c01 Version=18.08

   OS=Linux 3.10.0-1062.9.1.el7.x86_64 #1 SMP Mon Dec 2 08:31:54 EST 2019

RealMemory=191877 AllocMem=6536 FreeMem=176717 Sockets=2 Boards=1

State=MIXED+DRAIN ThreadsPerCore=1 TmpDisk=887366 Weight=1 Owner=N/A MCS_label=N/A

Partitions=interactive,short,long,medium,express

BootTime=2020-07-08T23:16:27 SlurmdStartTime=2020-07-08T23:32:05

CfgTRES=cpu=24,mem=191877M,billing=24

AllocTRES=cpu=7,mem=6536M

CapWatts=n/a

CurrentWatts=0 LowestJoules=0 ConsumedJoules=0

ExtSensorsJoules=n/s ExtSensorsWatts=0 ExtSensorsTemp=n/s

Reason=Reboot ASAP [root@2020-08-06T10:29:22]

Any thoughts as to how to cancel the reboot?

----------------

Mike Hanby

mhanby @ uab.edu

Systems Analyst III - Enterprise

IT Research Computing Services

The University of Alabama at Birmingham

--
Prentice Bisbal
Lead Software Engineer
Research Computing
Princeton Plasma Physics Laboratory
http://www.pppl.gov

Reply via email to