Gaya Nadarajan <[email protected]> writes:

> Btw, it is version 6.2u5.
>
> Thanks,
> Gaya
>
> On 04/03/13 14:34, Gaya Nadarajan wrote:
>> Hi,
>>
>> When I resent the jobs using a one-second delay between them it
>> worked, I supplied the error log in my previous post, but I'm not
>> certain what went wrong where.

I wondered how you know it's "failing to acknowledge jobs sent from the
qmaster", which the log definitely didn't tell me.

>> Another detail, I used the DRMAA api
>> to connect to sge rather than qsub directly.

If you can provide a test case which shows the problem, I can try it on
the latest version, although not on an ancient OS.

>> Not sure which version but gridengine on Ubuntu 3.0

3.0?  Gosh.

>> Cheers,
>> Gaya
>>
>> On 01/03/13 11:05, Dave Love wrote:
>>> Gaya Nadarajan <[email protected]> writes:
>>>
>>>> Hi,
>>>>
>>>> I'm from Edinburgh but I'm not using the cluster here, I have a
>>>> private cluster set up for a project elsewhere. Anyhow, the problem is
>>>> associated with the execution daemon failing to acknowledge jobs sent
>>>> from the qmaster, causing some jobs to be 'lost'.
>>> How did you determine that, and in what version?  If it might apply to
>>> current SGE, please make a bug report with details (see below).
>>>
>>
>>

-- 
Community Grid Engine:  http://arc.liv.ac.uk/SGE/
_______________________________________________
users mailing list
[email protected]
https://gridengine.org/mailman/listinfo/users

Reply via email to