Hello List, recently, I've stumbled across a weird problem: jobs that should be eligible for scheduling (qalter -w p reports a "possible assignment") are not actually started. I see lines like this in spool/qmaster/messages: 06/18/2013 16:06:55|schedu|owl-master1|P|PROF: scheduled in 102.740 (u 115.210 + s 6.910 = 122.120): 0 sequential, 76 parallel, 4457 orders, 873 H, 227 Q, 834 QA, 1806 J(qw), 425 J(r), 0 J(s), 0 J(h), 0 J(e), 1 J(x), 4452 J(all), 63 C, 7 ACL, 173 PE, 58 U, 3 D, 0 PRJ, 1 ST, 0 CKPT, 0 RU, 1 gMes, 0 jMes, 4457/5 pre-send, -100/-172/-557 pe-alg
Looking at common/schedule, I see STARTING lines for the same jobs repeated over and over (i.e. on each iteration). There is another weirdness involved here (I do not know to what extent it is related): node12-36 has gpu=0, so 151892 definitely should not be able to start there. BTW, the I've added gpu=0 after first noticing this problem with no gpu setting on the node at all. 151892:1:STARTING:1371564416:172860:P:openmp_fast:slots:8.000000 151892:1:STARTING:1371564416:172860:H:node12-36.cm.cluster:slots:1.000000 151892:1:STARTING:1371564416:172860:H:node12-36.cm.cluster:exclusive:1.000000 151892:1:STARTING:1371564416:172860:H:node12-36.cm.cluster:h_data:52428800.000000 151892:1:STARTING:1371564416:172860:H:node12-36.cm.cluster:gpu:1.000000 151892:1:STARTING:1371564416:172860:Q:[email protected]:slots:1.000000 151892:1:STARTING:1371564416:172860:L:max_slots_per_user:tgraen/////:1.000000 151892:1:STARTING:1371564416:172860:H:node12-36.cm.cluster:slots:7.000000 151892:1:STARTING:1371564416:172860:H:node12-36.cm.cluster:exclusive:7.000000 151892:1:STARTING:1371564416:172860:H:node12-36.cm.cluster:h_data:367001600.000000 151892:1:STARTING:1371564416:172860:Q:[email protected]:slots:7.000000 151892:1:STARTING:1371564416:172860:L:max_slots_per_user:tgraen/////:7.000000 I am at a complete loss. This is on OGS/GE 2011.11. Regards, A. -- Ansgar Esztermann DV-Systemadministration Max-Planck-Institut für biophysikalische Chemie, Abteilung 105
smime.p7s
Description: S/MIME cryptographic signature
_______________________________________________ users mailing list [email protected] https://gridengine.org/mailman/listinfo/users
