Am 17.03.2013 um 07:22 schrieb Joseph Farran: > On 1/4/2013 10:37 AM, Reuti wrote: >> Am 02.01.2013 um 05:08 schrieb Joseph Farran: >> >>> Hello Reuti. >>> >>> Yes, the job(s) are not suspending (S) as they normally do. So it's not >>> the queue, but the jobs. >> But is the queue in suspended state (qstat -f)? > > Sorry Reuti, missed your question. > > Yes, the queue is SUSPENDED but jobs continue to run: Here is one example: > > [email protected] BIP 0/4/64 11.21 lx-amd64 S > 242709 0.00355 CMAPNN mengfant r 03/15/2013 02:27:23 2 20 > 242709 0.00355 CMAPNN mengfant r 03/15/2013 02:27:23 2 33
Were these slave tasks of a parallel job? -- Reuti > Any idea why it keeps forgetting to suspend? Only happens once in a while > but it overloads the nodes when it does happen. > > > >> >> -- Reuti >> >> >>> Normally as soon as 1 or more core jobs enters the node through the queue, >>> the subordinate jobs suspend immediately. Once is a while, the jobs that >>> go in through the subordinate queue do not suspend as they should. >>> >>> On 1/1/2013 7:04 AM, Reuti wrote: >>>> Engine Forgets and does not suspend and the node is overloaded. >>>> The queue is not going into the "S" state or the jobs therein are just not >>>> suspended? >>>> >>>> -- Reuti >>>> >> > _______________________________________________ users mailing list [email protected] https://gridengine.org/mailman/listinfo/users
