Hi Abdul, Please see https://stackoverflow.com/questions/45985753/what-is-the-difference-between-dofn-setup-and-dofn-startbundle - let me know if it answers your question sufficiently.
On Mon, May 21, 2018 at 7:04 PM Abdul Qadeer <[email protected]> wrote: > Hi! > > I was trying to understand the behavior of StartBundle and FinishBundle > w.r.t. DoFns. > I have an unbounded data source and I am trying to leverage bundling to > achieve batching. > From the docs of ParDo: > > "when a ParDo transform is executed, the elements of the input PCollection > are first divided up into some number of "bundles" > > I would like to know if bundling is possible for unbounded data in the > first place. If it is then how do I control the bundle size i.e. number of > elements of a given PCollection in that bundle? >
