Gaya Nadarajan <[email protected]> writes: > Btw, it is version 6.2u5. > > Thanks, > Gaya > > On 04/03/13 14:34, Gaya Nadarajan wrote: >> Hi, >> >> When I resent the jobs using a one-second delay between them it >> worked, I supplied the error log in my previous post, but I'm not >> certain what went wrong where.
I wondered how you know it's "failing to acknowledge jobs sent from the qmaster", which the log definitely didn't tell me. >> Another detail, I used the DRMAA api >> to connect to sge rather than qsub directly. If you can provide a test case which shows the problem, I can try it on the latest version, although not on an ancient OS. >> Not sure which version but gridengine on Ubuntu 3.0 3.0? Gosh. >> Cheers, >> Gaya >> >> On 01/03/13 11:05, Dave Love wrote: >>> Gaya Nadarajan <[email protected]> writes: >>> >>>> Hi, >>>> >>>> I'm from Edinburgh but I'm not using the cluster here, I have a >>>> private cluster set up for a project elsewhere. Anyhow, the problem is >>>> associated with the execution daemon failing to acknowledge jobs sent >>>> from the qmaster, causing some jobs to be 'lost'. >>> How did you determine that, and in what version? If it might apply to >>> current SGE, please make a bug report with details (see below). >>> >> >> -- Community Grid Engine: http://arc.liv.ac.uk/SGE/ _______________________________________________ users mailing list [email protected] https://gridengine.org/mailman/listinfo/users
