[ 
https://issues.apache.org/jira/browse/FLINK-14815?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16977303#comment-16977303
 ] 

lining commented on FLINK-14815:
--------------------------------

For the pool usages, in the case of data skew, although the average is very 
low, the status of a task is not good. On the vertex, the pool usage could help 
users to locate the bottleneck point based on these values ​​when there is no 
backpressure. If it is only displayed in the subtask, the user needs to view 
subtasks of each vertex.

> Expose network pool usage in IOMetricsInfo
> ------------------------------------------
>
>                 Key: FLINK-14815
>                 URL: https://issues.apache.org/jira/browse/FLINK-14815
>             Project: Flink
>          Issue Type: Sub-task
>          Components: Runtime / Metrics, Runtime / Network, Runtime / REST
>            Reporter: lining
>            Assignee: lining
>            Priority: Major
>
> * If sub task is not back pressured, but it is causing a back pressure (full 
> input, empty output)
>  * By comparing exclusive/floating buffers usage, whether all channels are 
> back-pressured or only some of them
> {code:java}
> public final class IOMetricsInfo {
>     private final float outPoolUsage;
>     private final float inputExclusiveBuffersUsage;
>     private final float inputFloatingBuffersUsage;
> }
> {code}
> JobDetailsInfo.JobVertexDetailsInfo merge use Math.max.(ps: outPoolUsage is 
> from upstream)



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to