Xiaolin Ha created HBASE-25720:
----------------------------------
Summary: Sync WAL stuck when prepare flush cache will prevent
flush cache and cause OOM
Key: HBASE-25720
URL: https://issues.apache.org/jira/browse/HBASE-25720
Project: HBase
Issue Type: Improvement
Affects Versions: 1.4.13
Reporter: Xiaolin Ha
Assignee: Xiaolin Ha
Attachments: prepare-flush-cache-stuck.png
We call HRegion#doSyncOfUnflushedWALChanges when preparing to flush cache. But
this WAL sync may stuck, and abort the flush of cache.
!prepare-flush-cache-stuck.png|width=519,height=246!
If we cannot aware of this problem in time, RS will OOM kill.
I think we should force abort RS when sync stuck in preparing, like in
committing snapshots.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)