GitHub user zhangminglei opened a pull request:
https://github.com/apache/flink/pull/6103
[FLINK-9413] [distributed coordination] Tasks can fail with Partitionâ¦
â¦NotFoundException if consumer deployment takes too long
## What is the purpose of the change
Tasks can fail with PartitionNotFoundException if consumer deployment takes
too long. And the producer has been assigned a slot but we do not wait until it
is actually running.
## Brief change log
Change the condition to make the producer wait until it is actually running.
## Verifying this change
This change is a trivial rework / code cleanup without any test coverage.
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/zhangminglei/flink flink-9413
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/flink/pull/6103.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #6103
----
commit a52434d14117fde4e911f9a8f81a2e10fdd9ba77
Author: zhangminglei <zml13856086071@...>
Date: 2018-05-30T12:17:17Z
[FLINK-9413] [distributed coordination] Tasks can fail with
PartitionNotFoundException if consumer deployment takes too long
----
---