jrmccluskey commented on code in PR #32082:
URL: https://github.com/apache/beam/pull/32082#discussion_r1725290176


##########
sdks/python/apache_beam/transforms/util.py:
##########
@@ -802,6 +802,13 @@ class BatchElements(PTransform):
   corresponding to its contents. Each batch is emitted with a timestamp at
   the end of their window.
 
+  When the max_batch_duration_secs arg is provided, a stateful implementation
+  of BatchElements is used to batch elements across bundles. This is most
+  impactful in streaming applications where many bundles only contain one
+  element. Larger max_batch_duration_secs values will reduce the throughput of

Review Comment:
   Throughput is the right term, that batching can create a bottleneck. 
   
   The documentation at 
https://beam.apache.org/documentation/patterns/batch-elements/ should outline 
it more clearly as far as tuning, I think routing users there along with the 
new docstring content will help a lot



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@beam.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

Reply via email to