[
https://issues.apache.org/jira/browse/HBASE-10552?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13902820#comment-13902820
]
Lars Hofhansl commented on HBASE-10552:
---------------------------------------
Trunk has a different fix for this:
{code}
if (scanner.seekTo(gaussianRandomRowBytes) < 0) {
LOG.info("Not able to seekTo " + new String(gaussianRandomRowBytes));
return;
}
{code}
Which is a hack, IMHO. The reason we get -1 is because we generated a seek key
before the first key of the file.
> HFilePerformanceEvaluation.GaussianRandomReadBenchmark fails sometimes.
> -----------------------------------------------------------------------
>
> Key: HBASE-10552
> URL: https://issues.apache.org/jira/browse/HBASE-10552
> Project: HBase
> Issue Type: Bug
> Reporter: Lars Hofhansl
> Priority: Minor
> Fix For: 0.96.2, 0.98.1, 0.99.0, 0.94.17
>
> Attachments: 10552-0.94.txt
>
>
> GaussianRandomReadBenchmark generates seek keys by using a Gaussian
> distribution with the mean of N/2 and a sigma of N/10 (N = number of rows
> used) and using this key directly to seek into the HFile. The HFile was
> seeded with keys from 0-N.
> This will fail if we ever generate a key < 0, which is rare, but by no means
> impossible. We need to clamp the min and max values to 0 and N, resp.
--
This message was sent by Atlassian JIRA
(v6.1.5#6160)