[
https://issues.apache.org/jira/browse/FLINK-19964?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17227153#comment-17227153
]
Zhu Zhu commented on FLINK-19964:
---------------------------------
We recently noticed the issue FLINK-19994 that pipelined region scheduling will
eagerly schedule all the vertices in a DataSet iteration job.
[~roman_khachatryan] Is it possible that the problem is caused by downstream
task allocated all available network buffers from global pool, and then the
upstream task cannot obtain any buffer and get stuck? If so, I think
FLINK-19994 can fix this problem.
However, I cannot reproduce the problem after 1700+ runs locally. So I'm not
sure whether my guess is correct.
> Gelly ITCase stuck on Azure in HITSITCase.testPrintWithRMatGraph
> ----------------------------------------------------------------
>
> Key: FLINK-19964
> URL: https://issues.apache.org/jira/browse/FLINK-19964
> Project: Flink
> Issue Type: Bug
> Components: Library / Graph Processing (Gelly), Runtime / Network,
> Tests
> Affects Versions: 1.12.0
> Reporter: Chesnay Schepler
> Assignee: Roman Khachatryan
> Priority: Blocker
> Labels: test-stability
> Fix For: 1.12.0
>
>
> The HITSITCase has gotten stuck on Azure. Chances are that something in the
> scheduling or network has broken it.
> https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=8919&view=logs&j=c5f0071e-1851-543e-9a45-9ac140befc32&t=1fb1a56f-e8b5-5a82-00a0-a2db7757b4f5
--
This message was sent by Atlassian Jira
(v8.3.4#803005)