[ 
https://issues.apache.org/jira/browse/HBASE-1902?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12764810#action_12764810
 ] 

Schubert Zhang commented on HBASE-1902:
---------------------------------------

@Stack,

I double checked the BigTable paper:
"We wrote a single string under each row key. Each string was generated 
randomly and was therefore uncompressible. In addition, strings
under different row key were distinct, so no cross-row compressionwas possible."

So, to let the PerformanceEvaluation tools can work same as BT paper, I am 
thinking maybe it is more flexible to implement two types of generateValue(): 
one default mehod generate random values as the paper, and the other generate 
less random values.

Do you think above is necessary? If it is necessary, I will do it immediately.

> Let PerformanceEvaluation support setting tableName and compress algorithm
> --------------------------------------------------------------------------
>
>                 Key: HBASE-1902
>                 URL: https://issues.apache.org/jira/browse/HBASE-1902
>             Project: Hadoop HBase
>          Issue Type: Improvement
>          Components: test
>    Affects Versions: 0.20.1
>            Reporter: Schubert Zhang
>            Assignee: Schubert Zhang
>            Priority: Minor
>             Fix For: 0.21.0
>
>         Attachments: HBASE-1902-v1.patch
>
>
> 1. Current PerformanceEvaluation.java defaultly does not use compression, and 
> always use table name as "TestTable". Tow comand- arguments are added to 
> support setting tableName and compression algorithm by user. e.g.:
> (1) tableName="PE_LZO", compression="lzo"
> bin/hbase org.apache.hadoop.hbase.PerformanceEvaluation --table=PE_LZO 
> --compress=lzo sequentialWrite 4
> (2) default (tabmeName="TesTable", compression="none"
> bin/hbase org.apache.hadoop.hbase.PerformanceEvaluation sequentialWrite 4
> 2. Current "public static byte[] generateValue(final Random r)" generates 
> bytes[] which is too random. So the effect of compression is not obvious 
> (e.g. there is not size reduce for LZO).
> A new "public static byte[] generateValue(final Random r)" is provided here. 
> We can get 4:1 compression ratio for LZO now.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to