[
https://issues.apache.org/jira/browse/HBASE-25720?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17377532#comment-17377532
]
Michael Stack commented on HBASE-25720:
---------------------------------------
Anything in the log before your png? That shows perhaps how or why the WAL
system is stuck? A jstack? Thanks [~Xiaolin Ha]
> Sync WAL stuck when prepare flush cache will prevent flush cache and cause OOM
> ------------------------------------------------------------------------------
>
> Key: HBASE-25720
> URL: https://issues.apache.org/jira/browse/HBASE-25720
> Project: HBase
> Issue Type: Improvement
> Affects Versions: 1.4.13
> Reporter: Xiaolin Ha
> Assignee: Xiaolin Ha
> Priority: Major
> Attachments: prepare-flush-cache-stuck.png
>
>
> We call HRegion#doSyncOfUnflushedWALChanges when preparing to flush cache.
> But this WAL sync may stuck, and abort the flush of cache.
> !prepare-flush-cache-stuck.png|width=519,height=246!
> If we cannot aware of this problem in time, RS will OOM kill.
> I think we should force abort RS when sync stuck in preparing, like in
> committing snapshots.
>
>
--
This message was sent by Atlassian Jira
(v8.3.4#803005)