Hello All,
Sorry to be replying to an existing thread for my question. Actually we
are also facing the issue of the Flink Kafka source stopping consuming
messages completely.
It only started consuming messages after we re-submitted the Job. But
this happened only once and now it is not getting reproduced at all.
We are not using any watermarking strategy in specific.
I have a few questions:
1. What is the default watermarking strategy used in Flink. Can I
quickly check the default parameters being used by calling some function
or so ?
2. Are there any other conditions in which a source can be marked as
Idle apart from the watermarking issue mentioned below ?
3. If a Flink source is marked as Idle, is there any way to make it
active without having to re-submit the Flink Job ?
Or Is it that the source automatically becomes active after a
certain duration ?
Many thanks in advance,
Anirban
On 16-06-2023 02:27, Ken Krugler wrote:
I think you’re hitting this issue:
https://issues.apache.org/jira/browse/FLINK-31632
Fixed in 1.16.2, 1.171.
— Ken
On Jun 15, 2023, at 1:39 PM, Piotr Domagalski <pi...@domagalski.com>
wrote:
Hi all!
We've been experimenting with watermark alignment in Flink 1.15 and
observed an odd behaviour that I couldn't find any mention of in the
documentation.
With the following strategy:
WatermarkStrategy.<Event>forBoundedOutOfOrderness(Duration.ofSeconds(60))
.withTimestampAssigner((e, t) -> e.timestamp)
.withIdleness(Duration.ofSeconds(3600))
.withWatermarkAlignment("group-1", Duration.ofSeconds(15));
Kafka sources stop consuming completely after 3600s (even when the
data is flowing into all the partitions). Is this an expected
behaviour? Where could I find more information on this?
--
Piotr Domagalski
--------------------------
Ken Krugler
http://www.scaleunlimited.com
Custom big data solutions
Flink, Pinot, Solr, Elasticsearch