As Moe mentioned you will need to set ReturnToService to 0 1 or 2, depending on the desired behavior.
From: andre roy [mailto:[email protected]] Sent: Monday, August 01, 2011 06:06 PM To: [email protected] <[email protected]> Subject: Re: [slurm-dev] Node state always down: low RealMemory Updating the node has changed its status to IDLE! thanks a bunch :) I'm surprised it doesn't automatically try to return to service when the slurmd daemon starts up. -- André ----- Original Message ----- From: [email protected] Sent: 08/01/11 04:58 PM To: [email protected] Subject: Re: [slurm-dev] Node state always down: low RealMemory Note the time stamp when the node was set down. It may never have been restored to service since then. Take a look at the configuration parameter ReturnToService in "man slurm.conf" and set appropriately. You can manually return the node to service with "scontrol update NodeName=ClusterNode0 State=Resume" > NodeName=ClusterNode0 Arch=i686 CoresPerSocket=1 > CPUAlloc=0 CPUErr=0 CPUTot=1 Features=(null) > OS=Linux RealMemory=2 Sockets=1 > State=DOWN ThreadsPerCore=1 TmpDisk=0 Weight=1 > Reason=Low RealMemory [slurm@2011-07-31T21:30:51] > I don't understand why the compute node is reporting low memory... > running >scontrol show slurm reports that the node has 1018 Mb > available to it and 480 Mb of disk space.
