Hi Guys,

I'm currently experiencing a problem with reservation. The job have been
submitted with appropriate --reservation  parameter, the reservation is
active and all nodes in reservation are in idle state.

Despite of this conditions job remains in pending state.  You can find
output from scontrol show command below.

Can you give me advice where can I found code responsible for running jobs
in reservation? I'm ussing backfill scheduler.


root@zdog:~# scontrol show job 33301
JobId=33301 Name=bash
   UserId=um(5830) GroupId=icm-meteo(105)
   Priority=58 Account=root QOS=normal
   JobState=PENDING Reason=Resources Dependency=(null)
   Requeue=1 Restarts=0 BatchFlag=0 ExitCode=0:0
   RunTime=00:00:00 TimeLimit=7-00:00:00 TimeMin=N/A
   SubmitTime=2013-11-05T09:35:32 EligibleTime=2013-11-05T09:35:32
   StartTime=Unknown EndTime=Unknown
   PreemptTime=None SuspendTime=None SecsPreSuspend=0
   Partition=hydra AllocNode:Sid=hpc:27470
   ReqNodeList=(null) ExcNodeList=(null)
   NodeList=(null)
   NumNodes=1 NumCPUs=1 CPUs/Task=1 ReqS:C:T=*:*:*
   MinCPUsNode=1 MinMemoryNode=0 MinTmpDiskNode=0
   Features=(null) Gres=(null) Reservation=meteo
   Shared=OK Contiguous=0 Licenses=(null) Network=(null)
   Command=/bin/bash
   WorkDir=/icm/home/um

root@zdog:~# scontrol show res meteo
ReservationName=meteo StartTime=2013-10-31T14:56:10
EndTime=2013-11-14T14:56:10 Duration=14-00:00:00
   Nodes=wn[2085,2091,2093,2095,2097] NodeCnt=5 CoreCnt=60
Features=intelx5660 PartitionName=hydra Flags=
   Users=um Accounts=(null) Licenses=(null) State=ACTIVE

root@zdog:~# scontrol show node wn2085
NodeName=wn2085 Arch=x86_64 CoresPerSocket=6
   CPUAlloc=0 CPUErr=0 CPUTot=12 CPULoad=0.04
Features=intelx5660,westmere,ib,qcg,noht
   Gres=(null)
   NodeAddr=wn2085 NodeHostName=wn2085
   OS=Linux RealMemory=24146 AllocMem=0 Sockets=2 Boards=1
   State=IDLE ThreadsPerCore=1 TmpDisk=442266 Weight=20
   BootTime=2013-10-31T14:26:42 SlurmdStartTime=2013-11-04T13:36:31
   CurrentWatts=0 LowestJoules=0 ConsumedJoules=0
   ExtSensorsJoules=n/s ExtSensorsWatts=0 ExtSensorsTemp=n/s


cheers,
marcin

Reply via email to