[ https://issues.apache.org/jira/browse/HBASE-17938?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15991857#comment-15991857 ]
Vladimir Rodionov commented on HBASE-17938: ------------------------------------------- {quote} Assuming IOE may come out of each of the calls above, shouldn't a state machine be designed for more robustness ? {quote} No needs to overcomplicate the feature. If system repair fails in the middle, user will get notified and will have a chance to fix everything by running repair tool manually. Repair operation is idempotent and can be run as many times as we need. Did I address your concerns, [~tedyu]? > General fault - tolerance framework for backup/restore operations > ----------------------------------------------------------------- > > Key: HBASE-17938 > URL: https://issues.apache.org/jira/browse/HBASE-17938 > Project: HBase > Issue Type: Sub-task > Reporter: Vladimir Rodionov > Assignee: Vladimir Rodionov > Fix For: 2.0.0 > > Attachments: HBASE-17938-v1.patch, HBASE-17938-v2.patch, > HBASE-17938-v3.patch > > > The framework must take care of all general types of failures during backup/ > restore and restore system to the original state in case of a failure. > That won't solve all the possible issues but we have a separate JIRAs for > them as a sub-tasks of HBASE-15277 -- This message was sent by Atlassian JIRA (v6.3.15#6346)