Hya, I started an answer on the stackoverflow question on this.
https://stackoverflow.com/questions/58041925/apache-beam-what-are-the-key-concepts-for-writing-efficient-data-processing-pi/58054759#58054759 Cheers Reza On Sat, 21 Sep 2019 at 15:35, dev wearebold <[email protected]> wrote: > Hello, > > I've been using Beam for a few months now and I'd like to know if there > are any best practices/materials that explain how to write efficient and > optimized Beam pipelines. > > For example using a CombinePerKey/ReducePerKey/SumPerKey instead of a > simple GroupByKey to reduce shuffling and latency. > > > Any tips are welcome as I want to learn a lot (: > > > Have a nice day, > > J. -- This email may be confidential and privileged. If you received this communication by mistake, please don't forward it to anyone else, please erase all copies and attachments, and please let me know that it has gone to the wrong person. The above terms reflect a potential business arrangement, are provided solely as a basis for further discussion, and are not intended to be and do not constitute a legally binding obligation. No legally binding obligations will be created, implied, or inferred until an agreement in final form is executed in writing by all parties involved.
