Edward J. Yoon created HAMA-636:
-----------------------------------

             Summary: Confined recovery
                 Key: HAMA-636
                 URL: https://issues.apache.org/jira/browse/HAMA-636
             Project: Hama
          Issue Type: New Feature
          Components: bsp core, messaging
            Reporter: Edward J. Yoon


"Confined recovery" mentioned in Pregel paper can be used to improve the cost 
and latency of recovery. 

In addition to the existing HDFS checkpoints,1) the tasks log outgoing messages 
to local filesystem for each superstep (See disk queue). When a task fails, 2) 
it reverts to the last checkpoint. 3) Other workers re-send messages sent to 
failed worker at each superstep occurring after the last checkpoint.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to