[ 
https://issues.apache.org/jira/browse/HBASE-16960?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15614234#comment-15614234
 ] 

binlijin commented on HBASE-16960:
----------------------------------

The problem can happen when:
(1)FSHLog#rollWriter throw exception and LogRoller call regionserver.abort 
(2)RingBufferEventHandler.onEvent process FSWALEntry throw DamagedWALException 
(3)RingBufferEventHandler.onEvent process safe point, set 
RingBufferEventHandler.exception=null
(4)RingBufferEventHandler.onEvent process SyncFuture (MemStoreFlusher.1  
FSHLog.sync)   endOfBatch=false
(5)RingBufferEventHandler.onEvent process FSWALEntry (ASYNC_WAL  FSHLog.append)
There is no other events, so the MemStoreFlusher.1  FSHLog.sync will hang.

> RegionServer hang when aborting
> -------------------------------
>
>                 Key: HBASE-16960
>                 URL: https://issues.apache.org/jira/browse/HBASE-16960
>             Project: HBase
>          Issue Type: Bug
>            Reporter: binlijin
>         Attachments: RingBufferEventHandler.png, 
> RingBufferEventHandler_exception.png, SyncFuture.png, 
> SyncFuture_exception.png, rs1081.jstack
>
>
> We see regionserver hang when aborting several times and cause all regions on 
> this regionserver out of service and then all affected applications stop 
> works.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to