Hi all, MengYue and I created FLIP-274[1] Introduce metric group for OperatorCoordinator. OperatorCoordinator is the coordinator for runtime operators and running on Job Manager. The coordination mechanism is operator events between OperatorCoordinator and its all operators, the coordination is more and more using in Flink, for example many Sources and Sinks depend on the mechanism to assign splits and coordinate commits to external systems. The OperatorCoordinator is widely using in flink kafka connector, flink pulsar connector, flink cdc connector, flink hudi connector and so on.
But there is not a suitable metric group scope for the OperatorCoordinator and not an implementation for the interface OperatorCoordinatorMetricGroup. These metrics in OperatorCoordinator could be how many splits/partitions have been assigned to source readers, how many files have been written out by sink writers, these metrics not only help users to know the job progress but also make big job maintaining easier. Thus we propose the FLIP-274 to introduce a new metric group scope for OperatorCoordinator and provide an internal implementation for OperatorCoordinatorMetricGroup. Could you help review this FLIP when you get time? Any feedback is appreciated! Best, Hang [1] https://cwiki.apache.org/confluence/display/FLINK/FLIP-274%3A+Introduce+metric+group+for+OperatorCoordinator