[
https://issues.apache.org/jira/browse/HBASE-28463?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Work on HBASE-28463 started by Janardhan Hungund.
-------------------------------------------------
> Time Based Priority for BucketCache
> -----------------------------------
>
> Key: HBASE-28463
> URL: https://issues.apache.org/jira/browse/HBASE-28463
> Project: HBase
> Issue Type: New Feature
> Components: BucketCache
> Reporter: Janardhan Hungund
> Assignee: Janardhan Hungund
> Priority: Major
>
> This Jira introduces the feature of time-based data tiering in HBase to
> optimize storage efficiency and access performance by segregating data based
> on its recency. By keeping recent data in the bucket cache (backed by faster
> storage types like SSDs) and evicting older data, the system aims to provide
> a more flexible control over the cache allocation and eviction logic via
> configuration, allowing for defining time priorities for cached data.
> The need for a more extensive cache allocation mechanism becomes even more
> critical on HBase deployments where cache access reflects on significant
> performance gains, such as when using cloud storage as the underlying file
> system.
> The data is segregated into hot or cold categories based on its age. The
> recent data within a specific time range (configured as hot-data-age) is
> treated as hot and is stored in the cache, while the older data is stored and
> accessed from the file system.
> This feature intends to provide the TCO gains by optimizing the utilization
> of high cost bucket cache. Perfect fit for the use cases that have the
> date-based data writes while the scans focus on the recently written data.
> Please find the detailed design document of the feature attached with the
> Jira.
> Thanks,
> Janardhan
--
This message was sent by Atlassian Jira
(v8.20.10#820010)