On 2 Mar 2011, at 18:59, Reuti wrote:

> Hi,
> 
> Am 02.03.2011 um 19:37 schrieb Chris Jewell:
> 
>> I was wondering if it was possible to get GE to output an error message to 
>> the stderr file in response to a job being killed due to it exceeding a 
>> resource request?  
> 
> yep, it's sometimes not easy to investigate why a job was killed as you have 
> to check the messages file of the appropriate nodes. As you have only SMP 
> jobs in the parallel case there is only one machine to check, and it can be 
> attached to the email which is send to the user. Please find attached a 
> mail-wrapper which uses a local messages file, but it can be adjusted to 
> reflect your path. In case you face race conditions that the email is send 
> too early before there is an entry in the messages file, a `sleep 5` or alike 
> should help.

Thanks for that, Reuti.  I'm a little confused as to where to include it in the 
config -- are you meaning to replace the mailer on the host with it?  Would it 
be possible to write to the stderr with an epilogue script, harvesting the same 
line from the messages file, I wonder?

Chris


--
Dr Chris Jewell
Department of Statistics
University of Warwick
Coventry
CV4 7AL
UK
Tel: +44 (0)24 7615 0778






_______________________________________________
users mailing list
[email protected]
https://gridengine.org/mailman/listinfo/users

Reply via email to