Eroma created AIRAVATA-2944:
-------------------------------
Summary: Job failures due to wall-time exceed should display/send
the failure reason to users
Key: AIRAVATA-2944
URL: https://issues.apache.org/jira/browse/AIRAVATA-2944
Project: Airavata
Issue Type: Improvement
Components: helix implementation
Affects Versions: 0.18
Environment: https://staging.ultrascan.scigap.org
Reporter: Eroma
Assignee: Dimuthu Upeksha
Fix For: 0.18
When jobs fail due to wall time exceed the STDERR has message 'slurmstepd:
error: *** JOB 2305055 ON c413-043 CANCELLED AT 2018-10-29T02:46:27 DUE TO TIME
LIMIT ***'
and
the job emails comes with subject '....Run time 13:00:11, TIMEOUT, ExitCode 0'
The email subject can be processed and display/send the TIMOUT as the reson for
job FAIL.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)