[ https://issues.apache.org/jira/browse/HDDS-3354?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17112873#comment-17112873 ]
Marton Elek commented on HDDS-3354: ----------------------------------- bq. If there are no further objection I have no further objection, but have a comment to your answer. Take it as a friendly chat during the coffee break about interesting questions related to Distributed Systems. bq. if any of the steps fails like checkpoint succeeded, but snapshot file writes failed, then when Om restart That's an interesting question. If I understood well, your objections is that if we do ratis log snapshot and rocksdb snapshot (=checkpoint) at the same time they can be inconsistent with each other in case of any error. I don't think it's a problem. Writing ratis log snapshot can fail even now which should be handled. The only question if we can finalize both the snapshots in one step which should be possible: For example write ratis log snapshot file and rocksdb snapshot file to the same directory and move it to the final location. I wouldn't like to say it's better. But I think it's possible (How is your coffee?) > OM HA replay optimization > ------------------------- > > Key: HDDS-3354 > URL: https://issues.apache.org/jira/browse/HDDS-3354 > Project: Hadoop Distributed Data Store > Issue Type: Improvement > Reporter: Bharat Viswanadham > Assignee: Bharat Viswanadham > Priority: Major > Attachments: OM HA Replay.pdf, Screen Shot 2020-05-20 at 1.28.48 > PM.png > > > This Jira is to improve the OM HA replay scenario. > Attached the design document which discusses about the proposal and issue in > detail. -- This message was sent by Atlassian Jira (v8.3.4#803005) --------------------------------------------------------------------- To unsubscribe, e-mail: ozone-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: ozone-issues-h...@hadoop.apache.org