[
https://issues.apache.org/jira/browse/BEAM-7670?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16877293#comment-16877293
]
Kyle Weaver commented on BEAM-7670:
-----------------------------------
>From the logs, it looks like the stuck worker's data client is not connecting
>properly. For example, with parallelism=4:
2019-07-02 11:20:33,105 INFO
org.apache.beam.runners.fnexecution.control.FnApiControlClientPoolService -
Beam Fn Control client connected with id 1
2019-07-02 11:20:33,126 INFO
org.apache.beam.runners.fnexecution.control.FnApiControlClientPoolService -
Beam Fn Control client connected with id 1
2019-07-02 11:20:33,131 INFO
org.apache.beam.runners.fnexecution.data.GrpcDataService - Beam Fn Data client
connected.
2019-07-02 11:20:33,184 INFO
org.apache.beam.runners.fnexecution.control.FnApiControlClientPoolService -
Beam Fn Control client connected with id 1
2019-07-02 11:20:33,199 INFO
org.apache.beam.runners.fnexecution.data.GrpcDataService - Beam Fn Data client
connected.
2019-07-02 11:20:33,204 INFO
org.apache.beam.runners.fnexecution.control.FnApiControlClientPoolService -
Beam Fn Control client connected with id 1
2019-07-02 11:20:33,222 INFO
org.apache.beam.runners.fnexecution.data.GrpcDataService - Beam Fn Data client
connected.
> Flink portable worker gets stuck if one of the task does not get any data
> -------------------------------------------------------------------------
>
> Key: BEAM-7670
> URL: https://issues.apache.org/jira/browse/BEAM-7670
> Project: Beam
> Issue Type: Bug
> Components: runner-flink
> Reporter: Ankur Goenka
> Priority: Major
>
> When using parallelism > 1 with flink portable runner, the job gets stuck if
> the data is partitioned in such a way that one of the task does not get any
> data.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)