[
https://issues.apache.org/jira/browse/HBASE-16594?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
binlijin updated HBASE-16594:
-----------------------------
Description:
See HBASE-16213, ROW_INDEX_V1 DataBlockEncoding.
ROW_INDEX_V1 is the first version which have no storage optimization,
ROW_INDEX_V2 do storage optimization: store every row only once, store column
family only once in a HFileBlock.
ROW_INDEX_V1 is :
/**
* Store cells following every row's start offset, so we can binary search to a
row's cells.
*
* Format:
* flat cells
* integer: number of rows
* integer: row0's offset
* integer: row1's offset
* ....
* integer: dataSize
*
*/
ROW_INDEX_V2 is :
* row1 qualifier timestamp type value tag
* qualifier timestamp type value tag
* qualifier timestamp type value tag
* row2 qualifier timestamp type value tag
* row3 qualifier timestamp type value tag
* qualifier timestamp type value tag
* ....
* integer: number of rows
* integer: row0's offset
* integer: row1's offset
* ....
* column family
* integer: dataSize
was:
See HBASE-16213, ROW_INDEX_V1 DataBlockEncoding.
ROW_INDEX_V1 is the first version which have no storage optimization,
ROW_INDEX_V2 do storage optimization: store every row only once, store column
family only once in a HFileBlock.
> ROW_INDEX_V2 DBE
> ----------------
>
> Key: HBASE-16594
> URL: https://issues.apache.org/jira/browse/HBASE-16594
> Project: HBase
> Issue Type: Sub-task
> Components: Performance
> Reporter: binlijin
> Fix For: 2.0.0, 1.4.0
>
>
> See HBASE-16213, ROW_INDEX_V1 DataBlockEncoding.
> ROW_INDEX_V1 is the first version which have no storage optimization,
> ROW_INDEX_V2 do storage optimization: store every row only once, store column
> family only once in a HFileBlock.
> ROW_INDEX_V1 is :
> /**
> * Store cells following every row's start offset, so we can binary search to
> a row's cells.
> *
> * Format:
> * flat cells
> * integer: number of rows
> * integer: row0's offset
> * integer: row1's offset
> * ....
> * integer: dataSize
> *
> */
> ROW_INDEX_V2 is :
> * row1 qualifier timestamp type value tag
> * qualifier timestamp type value tag
> * qualifier timestamp type value tag
> * row2 qualifier timestamp type value tag
> * row3 qualifier timestamp type value tag
> * qualifier timestamp type value tag
> * ....
> * integer: number of rows
> * integer: row0's offset
> * integer: row1's offset
> * ....
> * column family
> * integer: dataSize
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)