Eroma created AIRAVATA-2944:
-------------------------------

             Summary: Job failures due to wall-time exceed should display/send 
the failure reason to users
                 Key: AIRAVATA-2944
                 URL: https://issues.apache.org/jira/browse/AIRAVATA-2944
             Project: Airavata
          Issue Type: Improvement
          Components: helix implementation
    Affects Versions: 0.18
         Environment: https://staging.ultrascan.scigap.org
            Reporter: Eroma
            Assignee: Dimuthu Upeksha
             Fix For: 0.18


When jobs fail due to wall time exceed the STDERR has message 'slurmstepd: 
error: *** JOB 2305055 ON c413-043 CANCELLED AT 2018-10-29T02:46:27 DUE TO TIME 
LIMIT ***' 

and

the job emails comes with subject '....Run time 13:00:11, TIMEOUT, ExitCode 0'

The email subject can be processed and display/send the TIMOUT as the reson for 
job FAIL.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to