Raminderjeet Singh created AIRAVATA-1354:
--------------------------------------------

             Summary: Job monitor for Stampede unknow status
                 Key: AIRAVATA-1354
                 URL: https://issues.apache.org/jira/browse/AIRAVATA-1354
             Project: Airavata
          Issue Type: Improvement
          Components: GFac
            Reporter: Raminderjeet Singh


We should using experiment id to name the jobs for unique identifier and then 
use that job name to identify if the job get to unknown status. If the job 
still is in unknown state we should check in working directory for stdout/err 
and make corrective action to correct the UNKNOWN statues. Same logic will be 
useful for job recovery if Airavata server restart.  



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to