[ https://issues.apache.org/jira/browse/HBASE-2223?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12833161#action_12833161 ]
Jean-Daniel Cryans commented on HBASE-2223: ------------------------------------------- That's my thinking too, so probably that the first version of this tool will just help the hbase administrator make the right choices. > Handle 10min+ network partitions between clusters > ------------------------------------------------- > > Key: HBASE-2223 > URL: https://issues.apache.org/jira/browse/HBASE-2223 > Project: Hadoop HBase > Issue Type: Sub-task > Reporter: Jean-Daniel Cryans > Assignee: Jean-Daniel Cryans > Fix For: 0.21.0 > > > We need a nice way of handling long network partitions without impacting a > master cluster (which pushes the data). Currently it will just retry over and > over again. > I think we could: > - Stop replication to a slave cluster if it didn't respond for more than 10 > minutes > - Keep track of the duration of the partition > - When the slave cluster comes back, initiate a MR job like HBASE-2221 > Maybe we want less than 10 minutes, maybe we want this to be all automatic or > just the first 2 parts. Discuss. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.