Emil Kleszcz created HBASE-29904:
------------------------------------

             Summary: Tombstone-heavy regions are not prioritized for major 
compaction, causing severe scan latency after deletes
                 Key: HBASE-29904
                 URL: https://issues.apache.org/jira/browse/HBASE-29904
             Project: HBase
          Issue Type: Improvement
    Affects Versions: 2.5.10
            Reporter: Emil Kleszcz


After large delete workloads, some regions accumulate very high numbers of 
tombstones. Until a major compaction runs, these regions suffer severe scan 
latency degradation from milliseconds to tens of seconds or even minutes.

HBase currently:
* does not expose tombstone density per region/store, and
* does not prioritize major compactions based on tombstone impact.

As a result, periodic major compactions may miss the worst regions for days or 
weeks, even though user-facing queries are already severely affected.

Observed behavior:
* Before deletes or after major compaction: ~50–200 ms
* After deletes, before major compaction: 10–120 seconds
* After manual major compaction on the affected region: back to milliseconds

The slowdown is caused by scanning and filtering large volumes of delete 
markers (tombstones), not by storefile count or region size. Minor compactions 
do not resolve the issue. The issue is not present when _RAW_ option is set to 
_true_.

*Current limitations*
* No metric to identify tombstone-heavy regions.
* Compaction selection is driven by file count / size, or last compaction time, 
not delete density.
* Operators must manually correlate slow queries with regions and trigger 
targeted major compactions.
* This risks both over-scheduling majors and missing the worst regions.

*Requested improvement*
* Expose tombstone-related metrics (e.g. delete marker count or ratio) at 
region/store level.
* Allow compaction scheduling to prioritize regions with high tombstone density 
or reclaimable space.
* Provide controls to rate-limit and restrict such tombstone-driven major 
compactions (e.g. off-peak only).

*Expected benefit*
* Predictable scan latency after deletes.
* Faster reclamation of deleted data.
* Reduced need for operator-driven major compactions prone to over-scheduling.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to