[
https://issues.apache.org/jira/browse/HBASE-16960?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15614234#comment-15614234
]
binlijin commented on HBASE-16960:
----------------------------------
The problem can happen when:
(1)FSHLog#rollWriter throw exception and LogRoller call regionserver.abort
(2)RingBufferEventHandler.onEvent process FSWALEntry throw DamagedWALException
(3)RingBufferEventHandler.onEvent process safe point, set
RingBufferEventHandler.exception=null
(4)RingBufferEventHandler.onEvent process SyncFuture (MemStoreFlusher.1
FSHLog.sync) endOfBatch=false
(5)RingBufferEventHandler.onEvent process FSWALEntry (ASYNC_WAL FSHLog.append)
There is no other events, so the MemStoreFlusher.1 FSHLog.sync will hang.
> RegionServer hang when aborting
> -------------------------------
>
> Key: HBASE-16960
> URL: https://issues.apache.org/jira/browse/HBASE-16960
> Project: HBase
> Issue Type: Bug
> Reporter: binlijin
> Attachments: RingBufferEventHandler.png,
> RingBufferEventHandler_exception.png, SyncFuture.png,
> SyncFuture_exception.png, rs1081.jstack
>
>
> We see regionserver hang when aborting several times and cause all regions on
> this regionserver out of service and then all affected applications stop
> works.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)