Xiaolin Ha created HBASE-25720:
----------------------------------

             Summary: Sync WAL stuck when prepare flush cache will prevent 
flush cache and cause OOM
                 Key: HBASE-25720
                 URL: https://issues.apache.org/jira/browse/HBASE-25720
             Project: HBase
          Issue Type: Improvement
    Affects Versions: 1.4.13
            Reporter: Xiaolin Ha
            Assignee: Xiaolin Ha
         Attachments: prepare-flush-cache-stuck.png

We call HRegion#doSyncOfUnflushedWALChanges when preparing to flush cache. But 
this WAL sync may stuck, and abort the flush of cache. 

!prepare-flush-cache-stuck.png|width=519,height=246!

If we cannot aware of this problem in time, RS will OOM kill.

I think we should force abort RS when sync stuck in preparing, like in 
committing snapshots.

 

 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to