[slurm-dev] Node not available due to future reservation?

Tim Donahue Thu, 22 Jun 2017 14:59:45 -0700

I have a very simple system, one controller, one server node. The nodeis up.

ubuntu@controller:~$ sinfo
PARTITION AVAIL  TIMELIMIT  NODES  STATE NODELIST
debug*       up   infinite      1   idle server1

I create a reservation containing the server node but having a starttime many days in advance:

ubuntu@controller:~$ scontrol show reservations -o
ReservationName=foo3 StartTime=2017-07-03T00:00:00EndTime=2017-07-03T01:00:00 Duration=01:00:00 Nodes=server1 NodeCnt=1CoreCnt=1 Features=(null) PartitionName=debug Flags= TRES=cpu=1Users=ubuntu Accounts=(null) Licenses=(null) State=INACTIVEBurstBuffer=(null) Watts=n/a
ubuntu@controller:~$


I then try to run a (very simple) job, but the job is queued:

ubuntu@controller:~$ srun hostname
srun: Required node not available (down, drained or reserved)
srun: job 630 queued and waiting for resources

squeue suggests the job is queued because the server node is not available:

ubuntu@controller:~$ squeue
JOBID PARTITION NAME USER ST TIME NODESNODELIST(REASON)629 debug hostname ubuntu PD 0:00 1(ReqNodeNotAvail, May be reserved for other job)


Is this the expected behavior and, if so, why?

Thanks

Tim Donahue

MIT / BU / MassOpenCloud

[slurm-dev] Node not available due to future reservation?

Reply via email to