[
https://issues.apache.org/jira/browse/HBASE-4007?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13100085#comment-13100085
]
Prakash Khemani commented on HBASE-4007:
----------------------------------------
Hi Stack, I have not pushed this out to production yet ... and the way things
are it will be a while before we do the next push to the hbase-90 tiers.
I will try to get some cluster testing done and will update this thread.
Regarding the use of ConcurrentHashMap as opposed to HashSet + ObjectLock : I
could not find any nice way to take a snapshot of a concurrent-hash-map. The
way the code is written I need to take a snapshot of the deadWorkers set.
I have just rebased. I will try to put it up in the reviewboard one more time.
Thanks,
Prakash
> distributed log splitting can get indefinitely stuck
> ----------------------------------------------------
>
> Key: HBASE-4007
> URL: https://issues.apache.org/jira/browse/HBASE-4007
> Project: HBase
> Issue Type: Bug
> Reporter: Prakash Khemani
> Assignee: Prakash Khemani
> Priority: Critical
> Fix For: 0.92.0
>
> Attachments:
> 0001-HBASE-4007-distributed-log-splitting-can-get-indefin.patch
>
>
> After the configured number of retries SplitLogManager is not going to
> resubmit log-split tasks. In this situation even if the splitLogWorker that
> owns the task dies the task will not get resubmitted.
> When a regionserver goes away then all the split-log tasks that it owned
> should be resubmitted by the SplitLogMaster.
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira