[ 
https://issues.apache.org/jira/browse/HBASE-20312?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16422205#comment-16422205
 ] 

Yu Li commented on HBASE-20312:
-------------------------------

bq. In a multi-tenant cluster, there may be table(s) with cell size much larger 
than 50Byte key-value mentioned in the description. Is CCSMap good for larger 
cell sizes as well ?
We will use HugeOnHeapChunk for cells larger than 4MB, which also uses the 
compacted structure (please check the codes for more details). In theory there 
won't be regression with large cells than existing MemStores.

We did discussed the need for mixed configuration during the design, it will be 
quite complicated for operators to use. Both CCSMapMemStore and MemStores with 
MSLAB pool have their chunk pool, how do operators configure the size for both 
(or multiple) pools? Should we allow stealing from each other? And what benefit 
we could gain from this mixed configuration comparing to the efforts paid? 
After these consideration our decision was not to support this, while of course 
people could have different ideas here.

And I believe there would be more questions like do we need to add some kind of 
CompactingMemStore based on CCSMap, etc. Personally I'd like to regard all 
these as possible future works (and would be happy to see more people got 
involved in the development), while here I suggest we focus on introducing the 
new creative CCSMap and a basic MemStore with it.

Rome is not built in one day, and let's start the work with a good base (smile).

> CCSMap: A faster, GC-friendly, less memory Concurrent Map for memstore
> ----------------------------------------------------------------------
>
>                 Key: HBASE-20312
>                 URL: https://issues.apache.org/jira/browse/HBASE-20312
>             Project: HBase
>          Issue Type: New Feature
>          Components: regionserver
>            Reporter: Xiang Wang
>            Assignee: Chance Li
>            Priority: Minor
>             Fix For: 3.0.0
>
>         Attachments: HBASE-20312-1.3.2.patch, HBASE-20312-master.v1.patch, 
> ccsmap-branch-1.1.patch, jira1.png, jira2.png, jira3.png
>
>
> Now hbase use ConcurrentSkipListMap as memstore's data structure.
> Although MemSLAB reduces memory fragment brought by key-value pairs.
> Hundred of millions key-value pairs still make young generation 
> garbage-collection(gc) stop time long.
>  
> These are 2 gc problems of ConcurrentSkipListMap:
> 1. HBase needs 3 objects to store one key-value on expectation. One 
> Index(skiplist's average node height is 1), one Node, and one KeyValue. Too 
> many objects are created for memstore.
> 2. Recent inserted KeyValue and its map structure(Index, Node) are assigned 
> on young generation.The card table (for CMS gc algorithm) or RSet(for G1 gc 
> algorithm) will change frequently on high writing throughput, which makes YGC 
> slow.
>  
> We devleoped a new skip-list map called CompactdConcurrentSkipListMap(CCSMap 
> for short),
> which provides similary features like ConcurrentSkipListMap but get rid of 
> Objects for every key-value pairs.
> CCSMap's memory structure is like this picture:
> !jira1.png!
>  
> One CCSMap consists of a certain number of Chunks. One Chunk consists of a 
> certain number of nodes. One node is corresspding one element. This element's 
> all information and its key-value is encoded on a continuous memory segment 
> without any objects.
> Features:
> 1. all insert,update,delete operations is lock-free on CCSMap.
> 2. Consume less memory, it brings 40% memory saving for 50Byte key-value.
> 3. Faster on small key-value because of better cacheline usage. 20~30% better 
> read/write troughput than ConcurrentSkipListMap for 50Byte key-value.
> CCSMap do not support recyle space when deleting element. But it doesn't 
> matter for hbase because of region flush.
> CCSMap has been running on Alibaba's hbase clusters over 17 months, it cuts 
> down YGC time significantly. here are 2 graph of before and after.
> !jira2.png!
> !jira3.png!
>  
>  



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to