[
https://issues.apache.org/jira/browse/BEAM-9733?focusedWorklogId=420215&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-420215
]
ASF GitHub Bot logged work on BEAM-9733:
----------------------------------------
Author: ASF GitHub Bot
Created on: 10/Apr/20 13:34
Start Date: 10/Apr/20 13:34
Worklog Time Spent: 10m
Work Description: mxm commented on issue #11362: [BEAM-9733] Always let
ImpulseSourceFunction emit a final watermark
URL: https://github.com/apache/beam/pull/11362#issuecomment-612030833
I need to rewrite the FlinkSavepointTest since it assumed different
semantics. I think we can use the recently introduced timer output timestamp
feature. However, I realized the portable operator needs a slight adjustment to
fully support holding back the output timestamp correctly at all times. This is
trickier than in the non-portable operator with respect to timers setting new
timers; we do not have a direct feedback loop as we have in the non-portable
operator. We need an additional check when a timer sets a new timer with a
timer output timestamp because we only get to fire that timer after we started
a new bundle. Thus, we can't always advance the watermark after we finish
bundle execution.
----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:
[email protected]
Issue Time Tracking
-------------------
Worklog Id: (was: 420215)
Time Spent: 1h 10m (was: 1h)
> ImpulseSourceFunction does not emit a final watermark
> -----------------------------------------------------
>
> Key: BEAM-9733
> URL: https://issues.apache.org/jira/browse/BEAM-9733
> Project: Beam
> Issue Type: Bug
> Components: runner-flink
> Reporter: Maximilian Michels
> Assignee: Maximilian Michels
> Priority: Critical
> Fix For: 2.21.0
>
> Time Spent: 1h 10m
> Remaining Estimate: 0h
>
> The Flink Runner's {{ImpulseSourceFunction}} does not emit a final watermark,
> unless {{--shutdownSourcesOnFinalWatermark}} flag has been specified (the
> flag is used in tests to shutdown the pipeline after reading all data). Most
> pipelines will be long-running and thus do not specify the flag.
> Not sending out the final watermark causes GroupByKey to hold back the data
> of event time windows until the pipeline is shut down (the final watermark is
> always emitted on pipeline shutdown which is why using the above flag works).
--
This message was sent by Atlassian Jira
(v8.3.4#803005)