[
https://issues.apache.org/jira/browse/HBASE-8321?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13632019#comment-13632019
]
Jimmy Xiang commented on HBASE-8321:
------------------------------------
The heartbeat stops if the worker doesn't hold the task any more, or the task
is completed. So, of course, you won't get the heart beats if the worker is
dead.
As to re-assign to a different worker in case one is really slow, yes, this
patch doesn't support it any more. The idea is that it doesn't help to give
the task to others if the file is under recovery, and it indeed takes time.
> Log split worker should heartbeat to avoid timeout
> --------------------------------------------------
>
> Key: HBASE-8321
> URL: https://issues.apache.org/jira/browse/HBASE-8321
> Project: HBase
> Issue Type: Bug
> Components: wal
> Reporter: Jimmy Xiang
> Assignee: Jimmy Xiang
> Attachments: trunk-8321_v1.patch
>
>
> Currently, hlog splitter could spend quite sometime to split a log in case
> any HDFS issue and recoverLease/retry opening is needed. If distributed log
> split manager times out the log worker, other log worker to take over will
> run into the same issue.
> Ideally, we should not need a timeout monitor. Since we have a timeout
> monitor for DSL now, the worker should heartbeat to avoid wrong/unneeded
> timeouts.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira