[
https://issues.apache.org/jira/browse/FLUME-1516?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13449883#comment-13449883
]
Ted Malaska commented on FLUME-1516:
------------------------------------
Cool. I will wait for your call Brock.
Thanks Hari. I will review the code with your comment in mind.
> Write Dual Checkpoints to avoid replays
> ---------------------------------------
>
> Key: FLUME-1516
> URL: https://issues.apache.org/jira/browse/FLUME-1516
> Project: Flume
> Issue Type: Improvement
> Components: Channel
> Affects Versions: v1.3.0
> Reporter: Brock Noland
>
> Per the LFS paper (http://www.cs.berkeley.edu/~brewer/cs262/LFS.pdf) we can
> write two checkpoints to avoid replaying the logs in the case we
> crash/shutdown while writing a checkpoint.
> Section 4:
> "In order to handle a crash during a checkpoint operation there are actually
> two checkpoint regions, and checkpoint operations alternate between them. The
> checkpoint time is in the last block of the checkpoint so if the checkpoint
> fails the time will not be updated. During reboot, the system reads both
> checkpoint regions and uses the one with the most recent time."
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira