[
https://issues.apache.org/jira/browse/FLUME-1516?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13528634#comment-13528634
]
Hari Shreedharan edited comment on FLUME-1516 at 12/11/12 3:01 AM:
-------------------------------------------------------------------
[~ted.m] Are you still working on this one? I am planning to work on this soon.
I have some ideas for this. I will post a concise design document for this in a
while.
was (Author: hshreedharan):
I am planning to work on this soon. I have some ideas for this. I will post
a concise design document for this in a while.
> Write Dual Checkpoints to avoid replays
> ---------------------------------------
>
> Key: FLUME-1516
> URL: https://issues.apache.org/jira/browse/FLUME-1516
> Project: Flume
> Issue Type: Improvement
> Components: Channel
> Affects Versions: v1.3.0
> Reporter: Brock Noland
>
> Per the LFS paper (http://www.cs.berkeley.edu/~brewer/cs262/LFS.pdf) we can
> write two checkpoints to avoid replaying the logs in the case we
> crash/shutdown while writing a checkpoint.
> Section 4:
> "In order to handle a crash during a checkpoint operation there are actually
> two checkpoint regions, and checkpoint operations alternate between them. The
> checkpoint time is in the last block of the checkpoint so if the checkpoint
> fails the time will not be updated. During reboot, the system reads both
> checkpoint regions and uses the one with the most recent time."
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira