[
https://issues.apache.org/jira/browse/HBASE-2888?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12893851#action_12893851
]
ryan rawson commented on HBASE-2888:
------------------------------------
here are some of the things i've identified as issues:
- HFile states, eg: fsReadLatency, is in milliseconds, and it should really be
in microseconds.
- we should generate 99th and 95th percentile for many of the stats (eg:
fsReadLatency) and publish it. Perhaps a 1 and/or 5 minute 99th rolling
percentile.
- The HFile metrics integration is a little weak, we use some volatiles and
scrape them, for the enhanced 99th/95th pc stats we'll need access to the
richer stats classes. HFile depends on Hadoop and hbase.util so with a little
moving of things around, hopefully it'll be possible to actually make better
stats w/o having HFile depends on HRS (for example)
> Review all our metrics
> ----------------------
>
> Key: HBASE-2888
> URL: https://issues.apache.org/jira/browse/HBASE-2888
> Project: HBase
> Issue Type: Improvement
> Components: master
> Reporter: Jean-Daniel Cryans
> Fix For: 0.90.0
>
>
> HBase publishes a bunch of metrics, some useful some wasteful, that should be
> improved to deliver a better ops experience. Examples:
> - Block cache hit ratio converges at some point and stops moving
> - fsReadLatency goes down when compactions are running
> - storefileIndexSizeMB is the exact same number once a system is serving
> production load
> We could use new metrics too.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.