Hm, I'm not sure how I can help then but I do have a separate script configured as we have problems with Lustre not being quick enough on a reboot. So I first unmount Lustre, remove Infiniband modules and then do a "reboot -f".
Am 01.03.2016 um 10:49 schrieb Chris Samuel: > > On Mon, 29 Feb 2016 11:50:18 PM Uwe Sauter wrote: > >> Did you configure the RebootProgram parameter in slurm.conf and is that >> script working? Remember: this script is run on the compute node, therefore >> it must be available on the compute node and must be executable. > > Yes, all our clusters are configured with: > > RebootProgram=/sbin/reboot > > It certainly works, as that's how I rebooted the compute nodes when > "scontrol reboot" wouldn't. ;-) > > There's nothing in our syslog, slurmctld.log or slurmd.log's that mentions > anything related to the "scontrol reboot". > > All the best, > Chris >
