Burn Lewis created UIMA-4321:
--------------------------------

             Summary: DUCC should nt retry JPs forever when
                 Key: UIMA-4321
                 URL: https://issues.apache.org/jira/browse/UIMA-4321
             Project: UIMA
          Issue Type: Bug
          Components: DUCC
    Affects Versions: 2.0.0-Ducc
            Reporter: Burn Lewis
            Assignee: Jerry Cwiklik
            Priority: Blocker


Job 235131 had a large string of JPs fail (when the JD OOM'd) with:
HttpWorkerThread.run() I/O exception 
(org.apache.commons.httpclient.NoHttpResponseException) caught when processing 
request: The server 192.168.3.77 failed to respond

For the short-term we should count this as a Croak (i.e. an unexpected 
termination that DUCC didn't request), even though it is not caused by user 
error, so that the users's process_failures_limit can eventually end the job.
Perhaps we need a "framework_failures_limit" in ducc.properties for errors 
caught in the ducc-side JP code as opposed to errors caught in user code.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to