damccorm opened a new issue, #20813:
URL: https://github.com/apache/beam/issues/20813

   Originated from: 
https://stackoverflow.com/questions/66381608/apache-beam-does-not-trigger-early-repeatedly-for-session-windows-on-google-data
   
   The following pipeline fires early after each element when running locally 
using DirectRunner, but there are no early triggers when running on google 
cloud dataflow. On dataflow it triggers only after the session window has 
closed.
   
   ```
   
   ( p
           | 'read'   >> beam.io.ReadFromPubSub(subscription = 
'projects/xxx/subscriptions/xxx-sub')
   
          | 'json'   >> beam.Map(lambda x: json.loads(x.decode('utf-8')))
           | 'kv'     >> beam.Map(lambda
   x: (x['id'], x['amount']))
           | 'window' >> beam.WindowInto(window.Sessions(15*60), 
trigger=trigger.Repeatedly(trigger.AfterCount(1)),
   accumulation_mode=AccumulationMode.ACCUMULATING)
           | 'group'  >> beam.GroupByKey()
           |
   'log'    >> beam.Map(lambda x: logging.info(x))
   )
   
   ```
   
   
   Apache Beam versions tried: 2.23 and 2.28.
   
   Imported from Jira 
[BEAM-11906](https://issues.apache.org/jira/browse/BEAM-11906). Original Jira 
may contain additional context.
   Reported by: ningk.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to