Hello List,

recently, I've stumbled across a weird problem: jobs that should be eligible 
for scheduling (qalter -w p reports a "possible assignment") are not actually 
started.
I see lines like this in spool/qmaster/messages:
06/18/2013 16:06:55|schedu|owl-master1|P|PROF: scheduled in 102.740 (u 115.210 
+ s 6.910 = 122.120): 0 sequential, 76 parallel, 4457 orders, 873 H, 227 Q, 834 
QA, 1806 J(qw), 425 J(r), 0 J(s), 0 J(h), 0 J(e), 1 J(x), 4452 J(all), 63 C, 7 
ACL, 173 PE, 58 U, 3 D, 0 PRJ, 1 ST, 0 CKPT, 0 RU, 1 gMes, 0 jMes, 4457/5 
pre-send, -100/-172/-557 pe-alg

Looking at common/schedule, I see STARTING lines for the same jobs repeated 
over and over (i.e. on each iteration).
There is another weirdness involved here (I do not know to what extent it is 
related): node12-36 has gpu=0, so 151892 definitely should not be able to start 
there.
BTW, the I've added gpu=0 after first noticing this problem with no gpu setting 
on the node at all.

151892:1:STARTING:1371564416:172860:P:openmp_fast:slots:8.000000
151892:1:STARTING:1371564416:172860:H:node12-36.cm.cluster:slots:1.000000
151892:1:STARTING:1371564416:172860:H:node12-36.cm.cluster:exclusive:1.000000
151892:1:STARTING:1371564416:172860:H:node12-36.cm.cluster:h_data:52428800.000000
151892:1:STARTING:1371564416:172860:H:node12-36.cm.cluster:gpu:1.000000
151892:1:STARTING:1371564416:172860:Q:[email protected]:slots:1.000000
151892:1:STARTING:1371564416:172860:L:max_slots_per_user:tgraen/////:1.000000
151892:1:STARTING:1371564416:172860:H:node12-36.cm.cluster:slots:7.000000
151892:1:STARTING:1371564416:172860:H:node12-36.cm.cluster:exclusive:7.000000
151892:1:STARTING:1371564416:172860:H:node12-36.cm.cluster:h_data:367001600.000000
151892:1:STARTING:1371564416:172860:Q:[email protected]:slots:7.000000
151892:1:STARTING:1371564416:172860:L:max_slots_per_user:tgraen/////:7.000000

I am at a complete loss.
This is on OGS/GE 2011.11.


Regards,

A.
-- 
Ansgar Esztermann
DV-Systemadministration
Max-Planck-Institut für biophysikalische Chemie, Abteilung 105

Attachment: smime.p7s
Description: S/MIME cryptographic signature

_______________________________________________
users mailing list
[email protected]
https://gridengine.org/mailman/listinfo/users

Reply via email to