I have a number of jobs hanging and it may be due to reservations.
showbf -v give me.

   [root@server bin]# ./showbf -v
   backfill window (user: 'root' group: 'root' partition: ALL) Fri Jul
   18 10:58:04

   1265 procs available for    1:15:17:58
   1264 procs available for    2:15:06:02
   1262 procs available with no timelimit


   node localhost is unavailable (state 'Down')
   node scotia3 is blocked by reservation NONE in   INFINITY
   node scotia256 is blocked by reservation NONE in   INFINITY
   node scotia4 is blocked by reservation 213922 in   INFINITY
   node scotia5 is blocked by reservation NONE in   INFINITY
   node scotia6 is blocked by reservation NONE in   INFINITY
   node scotia7 is blocked by reservation NONE in   INFINITY
   node scotia8 is blocked by reservation NONE in   INFINITY
   node scotia9 is blocked by reservation NONE in   INFINITY
   node scotia10 is blocked by reservation NONE in   INFINITY
   node scotia11 is blocked by reservation NONE in   INFINITY
   node scotia12 is blocked by reservation NONE in   INFINITY
   node scotia13 is blocked by reservation NONE in   INFINITY
   node scotia14 is blocked by reservation NONE in   INFINITY
   node scotia15 is blocked by reservation NONE in   INFINITY
   node scotia16 is blocked by reservation NONE in   INFINITY
   node scotia17 is blocked by reservation NONE in   INFINITY
   .........
   node wattson11 is blocked by reservation NONE in   INFINITY
   node wattson12 is blocked by reservation NONE in   INFINITY
   node p218inst34 is blocked by reservation NONE in   INFINITY


However the hosts report no reservations.

   [root@server bin]# ./checknode scotia3
   checking node scotia3
   State:      Idle  (in current state for 12:42:32)
   Configured Resources: PROCS: 4  MEM: 3829M  SWAP: 11G  DISK: 1M
   Utilized   Resources: SWAP: 197M
   Dedicated  Resources: [NONE]
   Opsys:         linux  Arch:      [NONE]
   Speed:      1.00  Load:       0.000
   Network:    [DEFAULT]
   Features:   [scotia]
   Attributes: [Batch]
   Classes:    [gangagpu 0:4][lab218 4:4][inti 4:4][wattson 4:4][ladon
   4:4][titan 4:4][scotia 4:4][ganga 4:4]
   Total Time:   INFINITY  Up:   INFINITY (90.70%)  Active: 4:40:45 (0.16%)
   Reservations:
   NOTE:  no reservations on node

   [root@server bin]# ./checknode wattson11
   checking node wattson11
   State:      Idle  (in current state for 15:18:55)
   Configured Resources: PROCS: 12  MEM: 11G  SWAP: 27G  DISK: 1M
   Utilized   Resources: SWAP: 592M
   Dedicated  Resources: [NONE]
   Opsys:         linux  Arch:      [NONE]
   Speed:      1.00  Load:       0.000
   Network:    [DEFAULT]
   Features:   [wattson]
   Attributes: [Batch]
   Classes:    [gangagpu 0:12][lab218 12:12][inti 12:12][wattson
   12:12][ladon 12:12][titan 12:12][scotia 12:12][ganga 12:12]
   Total Time: 1:02:38:41  Up: 1:01:22:09 (95.21%)  Active: 7:00:33
   (26.31%)
   Reservations:
   NOTE:  no reservations on node


Please any thoughts....
Eric

--

-Eric D. Prescott
Sr. Systems Administrator
111G IST Bldg.
Penn State Department of Computer Science and Engineering
[email protected]
Office: 814-863-1142

_______________________________________________
mauiusers mailing list
[email protected]
http://www.supercluster.org/mailman/listinfo/mauiusers

Reply via email to