[ 
https://issues.apache.org/jira/browse/FLINK-23486?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17422150#comment-17422150
 ] 

Roman Khachatryan commented on FLINK-23486:
-------------------------------------------

Hi [~complone], I've extracted back-pressure monitoring into FLINK-24402, as 
it's implementation is independent from other metrics but OTH it depends on the 
back-pressure. I think it can be reported as back-pressured by backend per 
second, similar to how "regular" back-pressure is currently reported 
([prototype|https://github.com/rkhachatryan/flink/tree/clsb-bp-test]).

Besides of it, I think we also need:
 # low-level (uploader) metrics, such as request rate, errors, saturation, and 
latency
 # and feature-specific metrics, such as for cross-operator grouping, and 
retries
 # backend-level latency metrics (similar to existing latency tracking)

I've added a PR to add (1) and (2).

(3) is independent as well and seems less critical.
 

> Add monitoring/metrics for the ChangelogStateBackend
> ----------------------------------------------------
>
>                 Key: FLINK-23486
>                 URL: https://issues.apache.org/jira/browse/FLINK-23486
>             Project: Flink
>          Issue Type: Sub-task
>          Components: Benchmarks
>            Reporter: Roman Khachatryan
>            Priority: Major
>              Labels: pull-request-available
>             Fix For: 1.15.0
>
>
> E.g. in-flight requests, request size, latency, number of “Logs” per request, 
> errors.
>  
> With back-pressure (FLINK-23381) it's very important because the task will be 
> shown as busy in the UI.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to