Re: [gridengine users] Job finishes correctly but master is not notified

2018-04-10 Thread Paul
William Hay" > To: "Paul Paul" > Cc: users@gridengine.org > Subject: Re: [gridengine users] Job finishes correctly but master is not > notified > > On Thu, Apr 05, 2018 at 03:38:18PM +0200, Paul Paul wrote: > > William, > > > > Thanks for yo

Re: [gridengine users] Job finishes correctly but master is not notified

2018-04-09 Thread Paul Paul
ul Paul" > Cc: users@gridengine.org > Subject: Re: [gridengine users] Job finishes correctly but master is not > notified > > On Thu, Apr 05, 2018 at 03:38:18PM +0200, Paul Paul wrote: > > William, > > > > Thanks for your reply. > > > > In the &#

Re: [gridengine users] Job finishes correctly but master is not notified

2018-04-05 Thread William Hay
On Thu, Apr 05, 2018 at 03:38:18PM +0200, Paul Paul wrote: > William, > > Thanks for your reply. > > In the 'messages' file of the exec host, there is nothing (the last message > was 2 weeks ago). Might be worth increasing the loglevel to get more info about what is going on there. William

Re: [gridengine users] Job finishes correctly but master is not notified

2018-04-05 Thread Paul Paul
e to run job: failed receiving gdi request response for mid=1 (got syncron message receive timeout error)." so it might help for this too. Paul. > Sent: Thursday, April 05, 2018 at 8:20 AM > From: "William Hay" > To: "Paul Paul" > Cc: users@gridengine.org > Sub

Re: [gridengine users] Job finishes correctly but master is not notified

2018-04-05 Thread William Hay
On Thu, Apr 05, 2018 at 09:46:23AM +0200, Paul Paul wrote: > Hello, > > We're using SGE 8.1.9 and randomly, we have jobs that finish with success > (our jobs logs confirm this) but the master is not notified. > On the compute, all the folders related to such a job are still here, > correctly fil

[gridengine users] Job finishes correctly but master is not notified

2018-04-05 Thread Paul Paul
Hello, We're using SGE 8.1.9 and randomly, we have jobs that finish with success (our jobs logs confirm this) but the master is not notified. On the compute, all the folders related to such a job are still here, correctly filled: trace file: ... 04/04/2018 21:50:13 [300:38328]: now running with