[
https://issues.apache.org/jira/browse/HBASE-5786?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13399053#comment-13399053
]
Andrew Wang commented on HBASE-5786:
------------------------------------
A real stats expert can weigh in, but I don't think the current sampling
methods are well-suited for computing high-percentile latencies. Reservoir
sampling is fine for computing gross statistics like the mean and stddev, but
you really want to be biasing your sampling toward the top end for accurate
95th and 99th percentile estimates.
I unfortunately don't have any solutions yet, but I'm looking into it.
> Implement histogram metrics for flush and compaction latencies and sizes.
> -------------------------------------------------------------------------
>
> Key: HBASE-5786
> URL: https://issues.apache.org/jira/browse/HBASE-5786
> Project: HBase
> Issue Type: New Feature
> Components: metrics, regionserver
> Affects Versions: 0.92.2, 0.94.0, 0.96.0
> Reporter: Jonathan Hsieh
>
> Average time for region operations doesn't really tell a useful story when
> that help diagnose anomalous conditions.
> It would be extremely useful to add histogramming metrics similar to
> HBASE-5533 for region operations like flush, compaction and splitting. The
> probably should be forward biased at a much coarser granularity however
> (maybe decay every day?)
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira