[
https://issues.apache.org/jira/browse/HBASE-4516?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13117761#comment-13117761
]
[email protected] commented on HBASE-4516:
------------------------------------------------------
-----------------------------------------------------------
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/2122/
-----------------------------------------------------------
(Updated 2011-09-29 23:54:05.869273)
Review request for hbase, Michael Stack and Jonathan Gray.
Changes
-------
Updating summary (sorry for spam).
Summary (updated)
-------
This is a load testing tool for HFile implementations, which supports two
workloads:
Compactions (merge the input HFiles). A special case of this is only one
input, which allows to do HFile format conversions.
Random reads. Launches the specified number of threads that do seeks and
short scans on randomly generated keys.
The original purpose of this tool was to ensure that HFile format v2 did not
introduce performance regressions.
Keys for the read workload are generated randomly between the first and the
last key of the HFile. At each position, instead of precisely calculating the
correct probability for every byte value b, we select a uniformly random byte
between in the allowed [low, high] range. In addition, there is a heuristic
that determines the positions at which the key has hex characters, and the
random key contains hex characters at those positions as well.
Example output for the random read workload:
Time: 120 sec, seek/sec: 8290, kv/sec: 30351, kv bytes/sec: 91868121, blk/sec:
10147, unique keys: 232779
Also refactoring and clarifying the confusing situation when a StoreFile
happens to have a different Bloom filter type than what is configured for the
column family.
This addresses bug HBASE-4516.
https://issues.apache.org/jira/browse/HBASE-4516
Diffs
-----
src/main/java/org/apache/hadoop/hbase/io/hfile/HFileBlock.java b429819
src/main/java/org/apache/hadoop/hbase/regionserver/StoreFile.java fd9e7ef
src/test/java/org/apache/hadoop/hbase/regionserver/HFileReadWriteTest.java
PRE-CREATION
Diff: https://reviews.apache.org/r/2122/diff
Testing
-------
Running unit tests and the load tester tool itself.
Thanks,
Mikhail
> HFile-level load tester with compaction and random-read workloads
> -----------------------------------------------------------------
>
> Key: HBASE-4516
> URL: https://issues.apache.org/jira/browse/HBASE-4516
> Project: HBase
> Issue Type: Test
> Reporter: Mikhail Bautin
> Priority: Minor
> Fix For: 0.94.0
>
>
> This is a load testing tool for HFile implementations, which supports two
> workloads:
> - Compactions (merge the input HFiles). A special case of this is only one
> input, which allows to do HFile format conversions.
> - Random reads. Launches the specified number of threads that do seeks and
> short scans on randomly generated keys.
> The original purpose of this tool was to ensure that HFile format v2 did not
> introduce performance regressions.
> Keys for the read workload are generated randomly between the first and the
> last key of the HFile. At each position, instead of precisely calculating the
> correct probability for every byte value b, we select a uniformly random byte
> between in the allowed [low, high] range. In addition, there is a heuristic
> that determines the positions at which the key has hex characters, and the
> random key contains hex characters at those positions as well.
> Example output for the random read workload:
> Time: 120 sec, seek/sec: 8290, kv/sec: 30351, kv bytes/sec: 91868121,
> blk/sec: 10147, unique keys: 232779
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira