[ 
https://issues.apache.org/jira/browse/BEAM-1372?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17546263#comment-17546263
 ] 

Kenneth Knowles commented on BEAM-1372:
---------------------------------------

This issue has been migrated to https://github.com/apache/beam/issues/18145

> OutputTimeFn and Accumulating Mode is Confusing
> -----------------------------------------------
>
>                 Key: BEAM-1372
>                 URL: https://issues.apache.org/jira/browse/BEAM-1372
>             Project: Beam
>          Issue Type: Bug
>          Components: beam-model
>            Reporter: Thomas Groh
>            Priority: P3
>
> See [here| 
> https://github.com/tgroh/beam/commit/2238df334a368ce1a41e14ee616be954c5430c73]
>  for an example pipeline
> The Timestamp used by a pane does not change based on the accumulation mode 
> of the windowing strategy - as a result, elements which have associated 
> timestamps can not be safely reassigned to those timestamps after a 
> GroupByKey if more than one pane could have been produced, regardless of the 
> {{OutputTimeFn}}. The first example pipeline demonstrates two PCollections 
> where the elements within the last PCollection cannot be reassigned to their 
> timestamps, even though we are using 
> {{OutputTimeFn#outputAtEarliestInputTimestamp}} and 
> When using a more complex windowing strategy like sessions, this is even more 
> confusing - a session that spans more than one of the downstream windows but 
> that is produced in multiple panes will over time be assigned to later and 
> later windows as more panes are produced - thus, a pipeline that produces 
> session windows and wishes to group the sessions by the point at which they 
> started must only ever produce a single pane per session.



--
This message was sent by Atlassian Jira
(v8.20.7#820007)

Reply via email to