Sergey Edunov created GIRAPH-950:
------------------------------------

             Summary: Auto-restart from checkpoint doesn't pick up latest 
checkpoint
                 Key: GIRAPH-950
                 URL: https://issues.apache.org/jira/browse/GIRAPH-950
             Project: Giraph
          Issue Type: Bug
            Reporter: Sergey Edunov


While running different jobs with checkpoints enabled I noticed some issues:
1) The way we pick up latest checkpoint is not correct. Current implementation 
just picks whatever is returned last from FileSystem.list(), which is not 
necessarily the last checkpoint
2) If job restarts from checkpoint it immediately creates another checkpoint. 
3) We need more flexibility in GiraphJobRetryChecker to allow restarts after 
multiple failures. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to