[
https://issues.apache.org/jira/browse/HDDS-3354?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17112873#comment-17112873
]
Marton Elek commented on HDDS-3354:
-----------------------------------
bq. If there are no further objection
I have no further objection, but have a comment to your answer. Take it as a
friendly chat during the coffee break about interesting questions related to
Distributed Systems.
bq. if any of the steps fails like checkpoint succeeded, but snapshot file
writes failed, then when Om restart
That's an interesting question. If I understood well, your objections is that
if we do ratis log snapshot and rocksdb snapshot (=checkpoint) at the same time
they can be inconsistent with each other in case of any error.
I don't think it's a problem. Writing ratis log snapshot can fail even now
which should be handled. The only question if we can finalize both the
snapshots in one step which should be possible: For example write ratis log
snapshot file and rocksdb snapshot file to the same directory and move it to
the final location.
I wouldn't like to say it's better. But I think it's possible (How is your
coffee?)
> OM HA replay optimization
> -------------------------
>
> Key: HDDS-3354
> URL: https://issues.apache.org/jira/browse/HDDS-3354
> Project: Hadoop Distributed Data Store
> Issue Type: Improvement
> Reporter: Bharat Viswanadham
> Assignee: Bharat Viswanadham
> Priority: Major
> Attachments: OM HA Replay.pdf, Screen Shot 2020-05-20 at 1.28.48
> PM.png
>
>
> This Jira is to improve the OM HA replay scenario.
> Attached the design document which discusses about the proposal and issue in
> detail.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]