Hi Shen, It is completely up to the runner how to divide things into bundles: it is one item of work that should fail or succeed atomically. Bundling limits parallelism, but does not determine it. For example, a streaming execution may have many bundles over time as elements arrive, regardless of parallelism.
Kenn On Fri, Jun 24, 2016 at 12:13 PM, Shen Li <[email protected]> wrote: > Hi, > > The document says "when a ParDo transform is executed, the elements of the > input PCollection are first divided up into some number of bundles". > > How do users control the number of bundles/parallelism? Or is it completely > up to the runner? > > Thanks, > > Shen >
