[ 
https://issues.apache.org/jira/browse/HBASE-7845?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13587031#comment-13587031
 ] 

Liang Xie commented on HBASE-7845:
----------------------------------

Here is a simple test result, loaded data with YCSB: 
recordcount=100000000
fieldcount=1
fieldlength=50


The original metric after loading done & major_compact:
numberOfStores=1, numberOfStorefiles=1, storefileUncompressedSizeMB=9847, 
storefileSizeMB=9850, compressionRatio=1.0003, memstoreSizeMB=197, 
storefileIndexSizeMB=0, readRequestsCount=0, writeRequestsCount=28225, 
rootIndexSizeKB=6, totalStaticIndexSizeKB=9356, totalStaticBloomSizeKB=0, 
totalCompactingKVs=99260688, currentCompactedKVs=99260688, 
compactionProgressPct=1.0, coprocessors=[]

The patched metric after loading done & major_compact:
numberOfStores=1, numberOfStorefiles=1, storefileUncompressedSizeMB=9845, 
storefileSizeMB=9848, compressionRatio=1.0003, memstoreSizeMB=197, 
storefileIndexSizeMB=0, readRequestsCount=0, writeRequestsCount=28225, 
rootIndexSizeKB=4, totalStaticIndexSizeKB=7161, totalStaticBloomSizeKB=0, 
totalCompactingKVs=99260688, currentCompactedKVs=99260688, 
compactionProgressPct=1.0, coprocessors=[]


we could observe "totalStaticIndexSizeKB" is reduced by (9356-7161)/9356 = 
23.4608807%, in deed, the result should be dominated by real key characteristic
                
> optimize hfile index key
> ------------------------
>
>                 Key: HBASE-7845
>                 URL: https://issues.apache.org/jira/browse/HBASE-7845
>             Project: HBase
>          Issue Type: Improvement
>          Components: HFile
>    Affects Versions: 0.96.0
>            Reporter: Liang Xie
>            Assignee: Liang Xie
>         Attachments: HBASE-7845.txt
>
>
> Leveldb uses ByteWiseComparatorImpl::FindShortestSeparator() & 
> FindShortSuccessor() to reduce index key size, it would be helpful under 
> special conditions.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to