[
https://issues.apache.org/jira/browse/HBASE-16960?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15614193#comment-15614193
]
binlijin commented on HBASE-16960:
----------------------------------
The problem is MemStoreFlusher.1 call FSHLog.sync and do not get a result so
not return.
{code}
"MemStoreFlusher.1" prio=10 tid=0x00007f553e0dc800 nid=0x27c91 in Object.wait()
[0x00007f5519d73000]
java.lang.Thread.State: TIMED_WAITING (on object monitor)
at java.lang.Object.wait(Native Method)
at
org.apache.hadoop.hbase.regionserver.wal.SyncFuture.get(SyncFuture.java:167)
- locked <0x00007f593d16e128> (a
org.apache.hadoop.hbase.regionserver.wal.SyncFuture)
at
org.apache.hadoop.hbase.regionserver.wal.FSHLog.blockOnSync(FSHLog.java:1523)
at
org.apache.hadoop.hbase.regionserver.wal.FSHLog.publishSyncThenBlockOnCompletion(FSHLog.java:1517)
at
org.apache.hadoop.hbase.regionserver.wal.FSHLog.sync(FSHLog.java:1607)
at
org.apache.hadoop.hbase.regionserver.HRegion.internalPrepareFlushCache(HRegion.java:2289)
at
org.apache.hadoop.hbase.regionserver.HRegion.internalFlushcache(HRegion.java:2110)
at
org.apache.hadoop.hbase.regionserver.HRegion.internalFlushcache(HRegion.java:2075)
at
org.apache.hadoop.hbase.regionserver.HRegion.flushcache(HRegion.java:1967)
at org.apache.hadoop.hbase.regionserver.HRegion.flush(HRegion.java:1893)
at
org.apache.hadoop.hbase.regionserver.MemStoreFlusher.flushRegion(MemStoreFlusher.java:510)
at
org.apache.hadoop.hbase.regionserver.MemStoreFlusher.flushRegion(MemStoreFlusher.java:471)
at
org.apache.hadoop.hbase.regionserver.MemStoreFlusher.access$800(MemStoreFlusher.java:75)
at
org.apache.hadoop.hbase.regionserver.MemStoreFlusher$FlushHandler.run(MemStoreFlusher.java:259)
at java.lang.Thread.run(Thread.java:756)
{code}
> RegionServer hang when aborting
> -------------------------------
>
> Key: HBASE-16960
> URL: https://issues.apache.org/jira/browse/HBASE-16960
> Project: HBase
> Issue Type: Bug
> Reporter: binlijin
> Attachments: rs1081.jstack
>
>
> We see regionserver hang when aborting several times and cause all regions on
> this regionserver out of service and then all affected applications stop
> works.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)