[ https://issues.apache.org/jira/browse/BEAM-9888?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Kenneth Knowles updated BEAM-9888: ---------------------------------- Status: Open (was: Triage Needed) > @RequiresTimeSortedInput might feed data out of order > ----------------------------------------------------- > > Key: BEAM-9888 > URL: https://issues.apache.org/jira/browse/BEAM-9888 > Project: Beam > Issue Type: Improvement > Components: runner-core > Affects Versions: 2.20.0 > Reporter: Jan Lukavský > Assignee: Jan Lukavský > Priority: Major > Fix For: 2.21.0 > > > StatefulDoFnRunner drops late data based on output watermark, but output > watermark might not advance fast enough to discard data that should have been > dropped. We have to either: > a) drop data based on input watermark, or > b) drop data based on timestamp of last emitted element > I'm a little in favor of b), because it can drop less data. -- This message was sent by Atlassian Jira (v8.3.4#803005)