Sergey Shelukhin created HBASE-21767:
----------------------------------------
Summary: findRegionsToForceFlush could be improved
Key: HBASE-21767
URL: https://issues.apache.org/jira/browse/HBASE-21767
Project: HBase
Issue Type: Improvement
Reporter: Sergey Shelukhin
Attachments: image-2019-01-23-16-07-42-166.png
We see Too many WALs message logged a lot, and it seems like it usually fails
to bring the WAL count back down for a long time, sometimes hours. This results
in large steady state WAL volume on some region servers, and slower recovery.
Based on the functionality in HBASE-21626, it should be possible to add an
option to be aggressive in this method, and to determine the minimum set of
regions that will actually bring WAL count close to a limit in one operation.
An example of too-many-WALs log statements reporting WAL count over ~2.5 hours,
with a limit of 79. You can see when the count is coming down; it often doesn't
come anywhere close to the limit so another flush is requested, etc. for a long
time.
!image-2019-01-23-16-07-42-166.png!
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)