damccorm opened a new issue, #20813: URL: https://github.com/apache/beam/issues/20813
Originated from: https://stackoverflow.com/questions/66381608/apache-beam-does-not-trigger-early-repeatedly-for-session-windows-on-google-data The following pipeline fires early after each element when running locally using DirectRunner, but there are no early triggers when running on google cloud dataflow. On dataflow it triggers only after the session window has closed. ``` ( p | 'read' >> beam.io.ReadFromPubSub(subscription = 'projects/xxx/subscriptions/xxx-sub') | 'json' >> beam.Map(lambda x: json.loads(x.decode('utf-8'))) | 'kv' >> beam.Map(lambda x: (x['id'], x['amount'])) | 'window' >> beam.WindowInto(window.Sessions(15*60), trigger=trigger.Repeatedly(trigger.AfterCount(1)), accumulation_mode=AccumulationMode.ACCUMULATING) | 'group' >> beam.GroupByKey() | 'log' >> beam.Map(lambda x: logging.info(x)) ) ``` Apache Beam versions tried: 2.23 and 2.28. Imported from Jira [BEAM-11906](https://issues.apache.org/jira/browse/BEAM-11906). Original Jira may contain additional context. Reported by: ningk. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
