[ 
https://issues.apache.org/jira/browse/HBASE-30018?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vladimir Rodionov updated HBASE-30018:
--------------------------------------
    Description: 
This umbrella tracks work to refactor the HBase block cache into a pluggable 
architecture with clear separation of:
        •       cache storage (engine)
        •       cache topology (L1/L2 orchestration)
        •       placement and admission policy

The current design tightly couples these concerns (BlockCache, 
CombinedBlockCache, BucketCache), making it difficult to introduce new cache 
implementations and evolve cache behavior.

In addition, existing implementations (e.g., BucketCache) incur significant 
metadata overhead at large scale (e.g., ~9GB for ~1.6TB cache with 64KB 
blocks), reducing effective cache capacity.

This effort introduces:
        •       BlockCacheEngine (storage abstraction)
        •       CacheTopology (tier orchestration)
        •       CachePlacementPolicy (admission, placement, promotion)
        •       explicit cache admission control on write/put path
        •       CacheAccessService for read/write path integration

The work will be implemented incrementally:
        •       Phase 1: introduce internal APIs (no behavior change)
        •       Phase 2: refactor existing topology (CombinedBlockCache)
        •       Phase 3: adapt BucketCache to new interfaces
        •       Phase 4: enable new engines (e.g., CarrotCache, EHCache)

This umbrella groups the individual tasks required to deliver this 
functionality.

RFC design doc: 
https://docs.google.com/document/d/1DOoRfqdDzC-Nz9zfCzSed8vbhRe5BMz8/edit?usp=sharing&ouid=107898024699489289958&rtpof=true&sd=true

  was:
This umbrella tracks work to refactor the HBase block cache into a pluggable 
architecture with clear separation of:
        •       cache storage (engine)
        •       cache topology (L1/L2 orchestration)
        •       placement and admission policy

The current design tightly couples these concerns (BlockCache, 
CombinedBlockCache, BucketCache), making it difficult to introduce new cache 
implementations and evolve cache behavior.

In addition, existing implementations (e.g., BucketCache) incur significant 
metadata overhead at large scale (e.g., ~9GB for ~1.6TB cache with 64KB 
blocks), reducing effective cache capacity.

This effort introduces:
        •       BlockCacheEngine (storage abstraction)
        •       CacheTopology (tier orchestration)
        •       CachePlacementPolicy (admission, placement, promotion)
        •       explicit cache admission control on write/put path
        •       CacheAccessService for read/write path integration

The work will be implemented incrementally:
        •       Phase 1: introduce internal APIs (no behavior change)
        •       Phase 2: refactor existing topology (CombinedBlockCache)
        •       Phase 3: adapt BucketCache to new interfaces
        •       Phase 4: enable new engines (e.g., CarrotCache)

This umbrella groups the individual tasks required to deliver this 
functionality.

RFC design doc: 
https://docs.google.com/document/d/1DOoRfqdDzC-Nz9zfCzSed8vbhRe5BMz8/edit?usp=sharing&ouid=107898024699489289958&rtpof=true&sd=true


> Pluggable HBase Block Cache System
> ----------------------------------
>
>                 Key: HBASE-30018
>                 URL: https://issues.apache.org/jira/browse/HBASE-30018
>             Project: HBase
>          Issue Type: Umbrella
>          Components: BlockCache
>            Reporter: Vladimir Rodionov
>            Assignee: Vladimir Rodionov
>            Priority: Major
>
> This umbrella tracks work to refactor the HBase block cache into a pluggable 
> architecture with clear separation of:
>       •       cache storage (engine)
>       •       cache topology (L1/L2 orchestration)
>       •       placement and admission policy
> The current design tightly couples these concerns (BlockCache, 
> CombinedBlockCache, BucketCache), making it difficult to introduce new cache 
> implementations and evolve cache behavior.
> In addition, existing implementations (e.g., BucketCache) incur significant 
> metadata overhead at large scale (e.g., ~9GB for ~1.6TB cache with 64KB 
> blocks), reducing effective cache capacity.
> This effort introduces:
>       •       BlockCacheEngine (storage abstraction)
>       •       CacheTopology (tier orchestration)
>       •       CachePlacementPolicy (admission, placement, promotion)
>       •       explicit cache admission control on write/put path
>       •       CacheAccessService for read/write path integration
> The work will be implemented incrementally:
>       •       Phase 1: introduce internal APIs (no behavior change)
>       •       Phase 2: refactor existing topology (CombinedBlockCache)
>       •       Phase 3: adapt BucketCache to new interfaces
>       •       Phase 4: enable new engines (e.g., CarrotCache, EHCache)
> This umbrella groups the individual tasks required to deliver this 
> functionality.
> RFC design doc: 
> https://docs.google.com/document/d/1DOoRfqdDzC-Nz9zfCzSed8vbhRe5BMz8/edit?usp=sharing&ouid=107898024699489289958&rtpof=true&sd=true



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to