We've noticed here that if the reservation has the maint flag set, once the node becomes idle it goes into state maint and resets the reboot flag without rebooting the node. However, if you create a reservation and set the flag ignore_jobs, it will do roughly the same thing as with the maint flag with exception to updating the node state to maint, and when the node becomes idle it will reboot.
-----Original Message----- From: Ade Fewings [mailto:[email protected]] Sent: Tuesday, March 01, 2016 7:23 AM To: slurm-dev <[email protected]> Subject: [slurm-dev] Re: scontrol reboot won't reboot reserved nodes? Yes, we've seen this on occasion as well. Seems to be something in interaction of node reboot and node state (drained, maint, etc.) - but I haven't quite managed to get my head round it yet. Sorry can't be more help. ~~ Ade ________________________________________ From: Chris Samuel <[email protected]> Sent: 01 March 2016 09:48 To: slurm-dev Subject: [slurm-dev] Re: scontrol reboot won't reboot reserved nodes? On Mon, 29 Feb 2016 11:50:18 PM Uwe Sauter wrote: > Did you configure the RebootProgram parameter in slurm.conf and is > that script working? Remember: this script is run on the compute node, > therefore it must be available on the compute node and must be executable. Yes, all our clusters are configured with: RebootProgram=/sbin/reboot It certainly works, as that's how I rebooted the compute nodes when "scontrol reboot" wouldn't. ;-) There's nothing in our syslog, slurmctld.log or slurmd.log's that mentions anything related to the "scontrol reboot". All the best, Chris -- Christopher Samuel Senior Systems Administrator VLSCI - Victorian Life Sciences Computation Initiative Email: [email protected] Phone: +61 (0)3 903 55545 http://www.vlsci.org.au/ http://twitter.com/vlsci ________________________________ [HPC Wales - www.hpcwales.co.uk] <http://www.hpcwales.co.uk> [ISO 9001] <http://www.bsigroup.co.uk/en-GB/iso-9001-quality-management/> [HPC Wales on Twitter]<https://twitter.com/#!/hpcwales> [HPC Wales on LinkedIn] <http://www.linkedin.com/company/2535987> ________________________________ The contents of this email and any files transmitted with it are confidential and intended solely for the named addressee only. Unless you are the named addressee (or authorised to receive this on their behalf) you may not copy it or use it, or disclose it to anyone else. If you have received this email in error, please notify the sender by email or telephone. All emails sent by High Performance Computing Wales have been checked using an Anti-Virus system. We would advise you to run your own virus check before opening any attachments received as we will not in any event accept any liability whatsoever, once an email and/or attachment is received. High Performance Computing Wales is a private limited company incorporated in Wales on 8 March 2010 as company number 07181701. Our registered office is at Ty Menai, Ffordd Penlan, Parc Menai Business Park, Bangor, Gwynedd. LL57 4HJ. UK. High Performance Computing Wales is part funded by the European Regional Development Fund through the Welsh Government.
