[
https://issues.apache.org/jira/browse/HBASE-4007?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13100628#comment-13100628
]
[email protected] commented on HBASE-4007:
------------------------------------------------------
-----------------------------------------------------------
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1747/
-----------------------------------------------------------
(Updated 2011-09-08 19:34:22.528153)
Review request for hbase, Ted Yu and Michael Stack.
Changes
-------
Implemented Ted's feedback
tested on a cluster that when a regionserver (splitlog-worker) dies then the
notification reaches SplitLogManager.
Summary
-------
1/ resubmit all tasks owned by a dead splitlog-worker
2/ prevent accumulation of /hbase/splitlog/RESCAN nodes
This addresses bug HBASE-4007.
https://issues.apache.org/jira/browse/HBASE-4007
Diffs (updated)
-----
src/main/java/org/apache/hadoop/hbase/master/SplitLogManager.java 54b6d45
src/main/java/org/apache/hadoop/hbase/master/MasterFileSystem.java 9a71fdf
src/main/java/org/apache/hadoop/hbase/zookeeper/ZKSplitLog.java 61e5c65
src/test/java/org/apache/hadoop/hbase/master/TestSplitLogManager.java 9a88855
Diff: https://reviews.apache.org/r/1747/diff
Testing
-------
1/ resubmit all tasks owned by a dead splitlog-worker - only unit tested. will
do cluster testing.
2/ prevent accumulation of /hbase/splitlog/RESCAN nodes - tested and deployed
in production.
Thanks,
Prakash
> distributed log splitting can get indefinitely stuck
> ----------------------------------------------------
>
> Key: HBASE-4007
> URL: https://issues.apache.org/jira/browse/HBASE-4007
> Project: HBase
> Issue Type: Bug
> Reporter: Prakash Khemani
> Assignee: Prakash Khemani
> Priority: Critical
> Fix For: 0.92.0
>
> Attachments:
> 0001-HBASE-4007-distributed-log-splitting-can-get-indefin.patch
>
>
> After the configured number of retries SplitLogManager is not going to
> resubmit log-split tasks. In this situation even if the splitLogWorker that
> owns the task dies the task will not get resubmitted.
> When a regionserver goes away then all the split-log tasks that it owned
> should be resubmitted by the SplitLogMaster.
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira