Alexey Serbin created KUDU-3566:
-----------------------------------
Summary: Incorrect semantics of Prometheus-style histogram-type
metrics
Key: KUDU-3566
URL: https://issues.apache.org/jira/browse/KUDU-3566
Project: Kudu
Issue Type: Bug
Components: master, tserver
Affects Versions: 1.17.0
Reporter: Alexey Serbin
Original KUDU-3375 implementation incorrectly exposes [summary-type Prometheus
metrics|https://prometheus.io/docs/concepts/metric_types/#summary] as
[histogram-type
ones|https://prometheus.io/docs/concepts/metric_types/#histogram] for data
collected by corresponding HDR histograms. For example, below are snippets
from {{/metric}} and {{/metrics_prometheus}} for statistics on ListMasters RPC.
JSON-style:
{noformat}
{
"name": "handler_latency_kudu_master_MasterService_ListMasters",
"total_count": 26,
"min": 152,
"mean": 301.2692307692308,
"percentile_75": 324,
"percentile_95": 468,
"percentile_99": 844,
"percentile_99_9": 844,
"percentile_99_99": 844,
"max": 844,
"total_sum": 7833
}
{noformat}
Prometheus-style counterpart:
{noformat}
# HELP kudu_master_handler_latency_kudu_master_MasterService_ListMasters
Microseconds spent handling kudu.master.MasterService.ListMasters RPC requests
# TYPE kudu_master_handler_latency_kudu_master_MasterService_ListMasters
histogram
kudu_master_handler_latency_kudu_master_MasterService_ListMasters_bucket{unit_type="microseconds",
le="0.75"} 324
kudu_master_handler_latency_kudu_master_MasterService_ListMasters_bucket{unit_type="microseconds",
le="0.95"} 468
kudu_master_handler_latency_kudu_master_MasterService_ListMasters_bucket{unit_type="microseconds",
le="0.99"} 844
kudu_master_handler_latency_kudu_master_MasterService_ListMasters_bucket{unit_type="microseconds",
le="0.999"} 844
kudu_master_handler_latency_kudu_master_MasterService_ListMasters_bucket{unit_type="microseconds",
le="0.9999"} 844
kudu_master_handler_latency_kudu_master_MasterService_ListMasters_bucket{unit_type="microseconds",
le="+Inf"} 26
kudu_master_handler_latency_kudu_master_MasterService_ListMasters_sum{unit_type="microseconds"}
7833
kudu_master_handler_latency_kudu_master_MasterService_ListMasters_count{unit_type="microseconds"}
26
{noformat}
--
This message was sent by Atlassian Jira
(v8.20.10#820010)