As Moe mentioned you will need to set ReturnToService to 0 1 or 2, depending on 
the desired behavior.


From: andre roy [mailto:[email protected]]
Sent: Monday, August 01, 2011 06:06 PM
To: [email protected] <[email protected]>
Subject: Re: [slurm-dev] Node state always down: low RealMemory

Updating the node has changed its status to IDLE!

thanks a bunch :)

I'm surprised it doesn't automatically try to return to service when the slurmd 
daemon starts up.

--
André




----- Original Message -----

From: [email protected]

Sent: 08/01/11 04:58 PM

To: [email protected]

Subject: Re: [slurm-dev] Node state always down: low RealMemory



Note the time stamp when the node was set down. It may never have been
restored to service since then. Take a look at the configuration
parameter ReturnToService in "man slurm.conf" and set appropriately.
You can manually return the node to service with "scontrol update
NodeName=ClusterNode0 State=Resume"

>  NodeName=ClusterNode0 Arch=i686 CoresPerSocket=1
>  CPUAlloc=0 CPUErr=0 CPUTot=1 Features=(null)
>  OS=Linux RealMemory=2 Sockets=1
>  State=DOWN ThreadsPerCore=1 TmpDisk=0 Weight=1
>  Reason=Low RealMemory [slurm@2011-07-31T21:30:51]
> I don't understand why the compute node is reporting low memory...
> running >scontrol show slurm reports that the node has 1018 Mb
> available to it and 480 Mb of disk space.


Reply via email to