Did you configure the RebootProgram parameter in slurm.conf and is that script working? Remember: this script is run on the compute node, therefore it must be available on the compute node and must be executable.
Am 01.03.2016 um 01:54 schrieb Christopher Samuel: > > Hi folks, > > We're at the tail end of a planned outage for a bunch of work, and we > have a reservation on all our compute nodes on all our clusters. > > I thought I would do an "scontrol reboot" to reboot our compute nodes > into a new xCAT osimage and then noticed that nothing happened. > > All the nodes had a reservation that was created with: > > scontrol create reservation starttime=2016-02-23T14:00 duration=7-0 \ > accounts=ibm,vlsci flags=maint,overlap nodes=ALL \ > reservationName=GPFS_RT19363 > > So it had the MAINT flag on it, and given that scontrol reboot marked > all nodes as a state of MAINT anyway I'm bemused that it didn't then > reboot the nodes. > > Anyone seen this before? > > We're on Slurm 15.08.8. > > All the best, > Chris >
