Jim Challenger created UIMA-3755:
------------------------------------

             Summary: DUCC RM inccorect reconcilliation of 'fixed' job on bounce
                 Key: UIMA-3755
                 URL: https://issues.apache.org/jira/browse/UIMA-3755
             Project: UIMA
          Issue Type: Bug
          Components: DUCC
            Reporter: Jim Challenger
            Assignee: Jim Challenger
            Priority: Minor


If a node running a process for a job of type 'fixed' crashes RM will 'purge' 
the node forcing the rest of ducc to clear its records of the process.

If you then 'bounce' RM, it will think the 'fixed' job is missing an allocation 
(which it technically is, but 'fixed' is defined such that processes that go 
away are not replaced).

RM needs logic so that while recovering non-preemptable jobs that have any 
allocation, it marks them 'allocation complete', the logic being that no 
allocation would have been given unless it was complete at some time in the 
past.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to