Eroma created AIRAVATA-2944: ------------------------------- Summary: Job failures due to wall-time exceed should display/send the failure reason to users Key: AIRAVATA-2944 URL: https://issues.apache.org/jira/browse/AIRAVATA-2944 Project: Airavata Issue Type: Improvement Components: helix implementation Affects Versions: 0.18 Environment: https://staging.ultrascan.scigap.org Reporter: Eroma Assignee: Dimuthu Upeksha Fix For: 0.18
When jobs fail due to wall time exceed the STDERR has message 'slurmstepd: error: *** JOB 2305055 ON c413-043 CANCELLED AT 2018-10-29T02:46:27 DUE TO TIME LIMIT ***' and the job emails comes with subject '....Run time 13:00:11, TIMEOUT, ExitCode 0' The email subject can be processed and display/send the TIMOUT as the reson for job FAIL. -- This message was sent by Atlassian JIRA (v7.6.3#76005)