[
https://issues.apache.org/jira/browse/FLINK-14815?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16977303#comment-16977303
]
lining edited comment on FLINK-14815 at 11/19/19 9:39 AM:
----------------------------------------------------------
For the pool usages, in the case of data skew, although the average is very
low, the status of a task is not good. On the vertex, the pool usage could help
users to locate the bottleneck point based on these values when there is no
backpressure. If it is only displayed in the subtask, the user needs to view
subtasks of each vertex. As you said, if one single task is backpressure, the
vertex is backpressure, so maybe max is more important than the other two.
was (Author: lining):
For the pool usages, in the case of data skew, although the average is very
low, the status of a task is not good. On the vertex, the pool usage could help
users to locate the bottleneck point based on these values when there is no
backpressure. If it is only displayed in the subtask, the user needs to view
subtasks of each vertex.
> Expose network pool usage in IOMetricsInfo
> ------------------------------------------
>
> Key: FLINK-14815
> URL: https://issues.apache.org/jira/browse/FLINK-14815
> Project: Flink
> Issue Type: Sub-task
> Components: Runtime / Metrics, Runtime / Network, Runtime / REST
> Reporter: lining
> Assignee: lining
> Priority: Major
>
> * If sub task is not back pressured, but it is causing a back pressure (full
> input, empty output)
> * By comparing exclusive/floating buffers usage, whether all channels are
> back-pressured or only some of them
> {code:java}
> public final class IOMetricsInfo {
> private final float outPoolUsage;
> private final float inputExclusiveBuffersUsage;
> private final float inputFloatingBuffersUsage;
> }
> {code}
> JobDetailsInfo.JobVertexDetailsInfo merge use Math.max.(ps: outPoolUsage is
> from upstream)
--
This message was sent by Atlassian Jira
(v8.3.4#803005)