[ https://issues.apache.org/jira/browse/HBASE-5844?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13463591#comment-13463591 ]
nkeywal commented on HBASE-5844: -------------------------------- Humm. The feature is important imho: waiting 30s (at best) before starting a recovery is really nice. In a ideal world, ZooKeeper would make this less useful by detecting the dead process sooner, but still it can't be faster than this. Note that znode remover should occur when the process finishes, not before starting a new one. What JD describes seems a bug to me. > Delete the region servers znode after a regions server crash > ------------------------------------------------------------ > > Key: HBASE-5844 > URL: https://issues.apache.org/jira/browse/HBASE-5844 > Project: HBase > Issue Type: Improvement > Components: regionserver, scripts > Affects Versions: 0.96.0 > Reporter: nkeywal > Assignee: nkeywal > Fix For: 0.96.0 > > Attachments: 5844.v1.patch, 5844.v2.patch, 5844.v3.patch, > 5844.v3.patch, 5844.v4.patch > > > today, if the regions server crashes, its znode is not deleted in ZooKeeper. > So the recovery process will stop only after a timeout, usually 30s. > By deleting the znode in start script, we remove this delay and the recovery > starts immediately. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira