[
https://issues.apache.org/jira/browse/HBASE-16594?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15482804#comment-15482804
]
binlijin commented on HBASE-16594:
----------------------------------
I get a part of column family a' data, and test it with ROW_INDEX_V2.
Second the random get qps result is:
{code}
RegionServer Network out is about 1.8GB
8k NONE (CPU System/User 7/58) QPS=167k
8k Row_Index_V1 (CPU System/User 7/60) QPS=164k
8k Row_Index_V2 (CPU System/User 7/52) QPS=164k
16k NONE (CPU System/User 7/59) QPS=166.5k
16k Row_Index_V1 (CPU System/User 7/55) QPS=165.6k
16k Row_Index_V2 (CPU System/User 7/54) QPS=165k
32k NONE (CPU System/User 7/63) QPS=165k
32k Row_Index_V1 (CPU System/User 7/56) QPS=166k
32k Row_Index_V2 (CPU System/User 7/54) QPS=164k
64k NONE (CPU System/User 7/65) QPS=160k
64k Row_Index_V1 (CPU System/User 7/56) QPS=165k
64k Row_Index_V2 (CPU System/User 7/53) QPS=165k
{code}
> ROW_INDEX_V2 DBE
> ----------------
>
> Key: HBASE-16594
> URL: https://issues.apache.org/jira/browse/HBASE-16594
> Project: HBase
> Issue Type: Sub-task
> Components: Performance
> Reporter: binlijin
> Fix For: 2.0.0, 1.4.0
>
> Attachments: HBASE-16594-master_v1.patch, HBASE-16594-master_v2.patch
>
>
> See HBASE-16213, ROW_INDEX_V1 DataBlockEncoding.
> ROW_INDEX_V1 is the first version which have no storage optimization,
> ROW_INDEX_V2 do storage optimization: store every row only once, store column
> family only once in a HFileBlock.
> ROW_INDEX_V1 is :
> /**
> * Store cells following every row's start offset, so we can binary search to
> a row's cells.
> *
> * Format:
> * flat cells
> * integer: number of rows
> * integer: row0's offset
> * integer: row1's offset
> * ....
> * integer: dataSize
> *
> */
> ROW_INDEX_V2 is :
> * row1 qualifier timestamp type value tag
> * qualifier timestamp type value tag
> * qualifier timestamp type value tag
> * row2 qualifier timestamp type value tag
> * row3 qualifier timestamp type value tag
> * qualifier timestamp type value tag
> * ....
> * integer: number of rows
> * integer: row0's offset
> * integer: row1's offset
> * ....
> * column family
> * integer: dataSize
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)