HFile-level load tester with compaction and random-read workloads
-----------------------------------------------------------------

                 Key: HBASE-4516
                 URL: https://issues.apache.org/jira/browse/HBASE-4516
             Project: HBase
          Issue Type: Test
            Reporter: Mikhail Bautin
            Priority: Minor
             Fix For: 0.94.0


This is a load testing tool for HFile implementations, which supports two 
workloads:
- Compactions (merge the input HFiles). A special case of this is only one 
input, which allows to do HFile format conversions.
- Random reads. Launches the specified number of threads that do seeks and 
short scans on randomly generated keys.

The original purpose of this tool was to ensure that HFile format v2 did not 
introduce performance regressions.

Keys for the read workload are generated randomly between the first and the 
last key of the HFile. At each position, instead of precisely calculating the 
correct probability for every byte value b, we select a uniformly random byte 
between in the allowed [low, high] range. In addition, there is a heuristic 
that determines the positions at which the key has hex characters, and the 
random key contains hex characters at those positions as well.

Example output for the random read workload:
Time: 120 sec, seek/sec: 8290, kv/sec: 30351, kv bytes/sec: 91868121, blk/sec: 
10147, unique keys: 232779


--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to