[
https://issues.apache.org/jira/browse/HAMA-411?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13062147#comment-13062147
]
Thomas Jungblut edited comment on HAMA-411 at 7/8/11 8:03 PM:
--------------------------------------------------------------
I overthought the logic with overriding the default implementation.
I don't think this gonna work, let's assume the user is going to set the
checkpoint to true in every thirds superstep.
And now a task failed (two supersteps after checkpointing) and we don't
acutally have the state safed to revert onto the calculation it was 3 steps ago.
And turning on and off the checkpointing should be configurable in the
Configuration not via a method.
So scratch all that fancyness I thought of, it isn't going to work.
was (Author: thomas.jungblut):
I overthought the overriding of the logic.
I don't think this gonna work, let's assume the user is going to set the
checkpoint to true in every thirds superstep.
And now a task failed (two supersteps after checkpointing) and we don't
acutally have the state safed to revert onto the calculation it was 3 steps ago.
And turning on and off the checkpointing should be configurable in the
Configuration not via a method.
So scratch all that fancyness I thought of, it isn't going to work.
> Support checkpoint based on HDFS
> --------------------------------
>
> Key: HAMA-411
> URL: https://issues.apache.org/jira/browse/HAMA-411
> Project: Hama
> Issue Type: New Feature
> Components: bsp
> Reporter: Thomas Jungblut
>
> We need to add checkpointing to Hama to deal with fault in future.
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira