[
https://issues.apache.org/jira/browse/HBASE-8631?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Jeffrey Zhong updated HBASE-8631:
---------------------------------
Attachment: hbase-8631.patch
This patch includes several changes from my integration testing.
1) pick wal firstly inside splitlogworker
2) decrease retries(fast fail) in distributedLogReplay mode so a splitlogworker
won't be blocked(currently server to server retries are set to 100 by default.
It means 50 mins wait)
3) Add support of removing recovering region when a RS has both meta and user
region situation
> Meta Region First Recovery
> --------------------------
>
> Key: HBASE-8631
> URL: https://issues.apache.org/jira/browse/HBASE-8631
> Project: HBase
> Issue Type: Bug
> Components: MTTR
> Reporter: Jeffrey Zhong
> Assignee: Jeffrey Zhong
> Attachments: hbase-8631.patch
>
>
> We have a separate wal for meta region. While log splitting logic haven't
> taken the advantage of this and splitlogworker still picks a wal file
> randomly. Imaging if we have multiple region servers including meta RS fails
> about the same time while meta wal is recovered last, all failed regions have
> to wait meta recovered and then can be online again.
> The open JIRA is to let splitlogworker to pick a meta wal file firstly and
> then others.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira