[
https://issues.apache.org/jira/browse/FLINK-14815?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16977303#comment-16977303
]
lining commented on FLINK-14815:
--------------------------------
For the pool usages, in the case of data skew, although the average is very
low, the status of a task is not good. On the vertex, the pool usage could help
users to locate the bottleneck point based on these values when there is no
backpressure. If it is only displayed in the subtask, the user needs to view
subtasks of each vertex.
> Expose network pool usage in IOMetricsInfo
> ------------------------------------------
>
> Key: FLINK-14815
> URL: https://issues.apache.org/jira/browse/FLINK-14815
> Project: Flink
> Issue Type: Sub-task
> Components: Runtime / Metrics, Runtime / Network, Runtime / REST
> Reporter: lining
> Assignee: lining
> Priority: Major
>
> * If sub task is not back pressured, but it is causing a back pressure (full
> input, empty output)
> * By comparing exclusive/floating buffers usage, whether all channels are
> back-pressured or only some of them
> {code:java}
> public final class IOMetricsInfo {
> private final float outPoolUsage;
> private final float inputExclusiveBuffersUsage;
> private final float inputFloatingBuffersUsage;
> }
> {code}
> JobDetailsInfo.JobVertexDetailsInfo merge use Math.max.(ps: outPoolUsage is
> from upstream)
--
This message was sent by Atlassian Jira
(v8.3.4#803005)