[
https://issues.apache.org/jira/browse/AIRAVATA-2956?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16698201#comment-16698201
]
Dimuthu Upeksha commented on AIRAVATA-2956:
-------------------------------------------
Fixed in
[https://github.com/apache/airavata/commit/f57cc7e57aa8f940dac4599e80dbec1715219ad5]
[https://github.com/apache/airavata/commit/e6fe2053313cf767283b15b1f5f07e382814ec61]
[https://github.com/apache/airavata/commit/96cb11b2b787ad8ea175103f540464c6e191420a]
https://github.com/apache/airavata/commit/b7b08422d2bacbcfdf2dd9256a780e279481427c
> Possible race condition in job monitoring
> -----------------------------------------
>
> Key: AIRAVATA-2956
> URL: https://issues.apache.org/jira/browse/AIRAVATA-2956
> Project: Airavata
> Issue Type: Bug
> Components: helix implementation
> Reporter: Dimuthu Upeksha
> Assignee: Dimuthu Upeksha
> Priority: Major
>
> When Job submission task submits a job to a compute resource, it returns a
> job id and then it is saved in a zookeeper path for post workflow execution.
> But in some cases, job completes before those metadata is saved in zookeeper
> and then post workflow fails.
> 018-11-21 18:15:55,783 [main] INFO o.a.a.h.i.w.PostWorkflowManager -
> Processing job result of job id 9839 sent by EmailBasedProducer
> 2018-11-21 18:15:55,785 [main] WARN o.a.a.h.i.w.PostWorkflowManager - Could
> not find a monitoring register for job id 9839
> 2018-11-21 18:15:55,785 [main] INFO o.a.a.h.i.w.PostWorkflowManager -
> Status of processing 9839 : false
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)