[
https://issues.apache.org/jira/browse/KUDU-3514?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Alexey Serbin updated KUDU-3514:
--------------------------------
Code Review: http://gerrit.cloudera.org:8080/20517
> Introduce per-RPC metric of responses sent past the client-defined deadline
> ---------------------------------------------------------------------------
>
> Key: KUDU-3514
> URL: https://issues.apache.org/jira/browse/KUDU-3514
> Project: Kudu
> Issue Type: Improvement
> Components: master, server, tserver
> Reporter: Alexey Serbin
> Assignee: Alexey Serbin
> Priority: Major
> Labels: metrics, supportability
>
> As of Kudu 1.17.0, there is per-RPC histogram for processing times of remote
> calls: {{handler_latency_xxx}}, where the {{xxx}} suffix stands for the RPC
> interface and method name. There is also service-wide
> {{rpc_incoming_queue_time}} historgram.
> However, even using these two metrics above it's hard to definitely tell how
> many calls of a particular RPC method timed out while being processed by a
> Kudu server, even if the client-defined timeout is known, fixed, and the same
> for all the RPCs processed by a Kudu server.
> It would be nice to introduce a new per-RPC counter to accumulate stats on
> the responses that have been sent back to the client side already past the
> deadline. That would help troubleshooting of particular scenarios and
> provide a glimpse of what part of requests to a particular RPC method timed
> out while being processed.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)