Oh, this is my mistake Yes correct way its use @Setup.
Thank you Eugene. 2016-11-18 22:54 GMT+04:00 Eugene Kirpichov <kirpic...@google.com.invalid>: > Hi Alexey, > > In general, things like establishing connections and initializing caches > are better done in @Setup and @TearDown methods, rather than @StartBundle > and @FinishBundle, because DoFn's can be reused between bundles and this > way you get more benefit from reuse. > > Bundles can be pretty small, especially in streaming pipelines. That said, > they normally shouldn't be 1-element-small. Hopefully someone working on > the Flink runner can comment. > > On Fri, Nov 18, 2016 at 10:47 AM amir bahmanyari > <amirto...@yahoo.com.invalid> wrote: > > > Hmmm...Thanks...This could very well be my bottleneck since I see tons of > > threads get on WAIT state after sometime& stay like that relatively > > forever.I have a 100 G worth of elements to process...........Is there a > > way to bypass this "startBundle" & get a fairly optimized > > behavior?Anyone? Thanks+regardsAmir- > > > > From: Demin Alexey <diomi...@gmail.com> > > To: dev@beam.incubator.apache.org; amir bahmanyari <amirto...@yahoo.com > > > > Sent: Friday, November 18, 2016 10:40 AM > > Subject: Re: Flink runner. Wrapper for DoFn > > > > Very simple example: > > > > My DoFn on startBundle load filters from remote db and build optimized > > index, on processElement apply filters on every element for decision > about > > push element to next operation or drop his. > > > > In current implementation it's like matching regexp on string, you have 2 > > way > > 1) compile regexp every time for every element > > 2) compile regexp one time and apply on all element > > > > now flink work by 1 way and this way not optimal > > > > > > 2016-11-18 22:26 GMT+04:00 amir bahmanyari <amirto...@yahoo.com.invalid > >: > > > > > Hi Alexey," startBundle can be expensive"...Could you elaborate on > > > "expensive" as per each element pls? > > > Thanks > > > > > > From: Demin Alexey <diomi...@gmail.com> > > > To: dev@beam.incubator.apache.org > > > Sent: Friday, November 18, 2016 7:40 AM > > > Subject: Flink runner. Wrapper for DoFn > > > > > > Hi > > > > > > In flink runner we have this code: > > > > > > https://github.com/apache/incubator-beam/blob/master/ > > > runners/flink/runner/src/main/java/org/apache/beam/runners/ > > > flink/translation/wrappers/streaming/DoFnOperator.java#L262 > > > > > > but in mostly cases method startBundle can be expensive for making for > > > every element (for example connection for db/build cache/ etc) > > > > > > Why so important invoke startBundle/finishBundle on every > > > incoming streamRecord ? > > > > > > Thanks > > > Alexey Diomin > > > > > > > > > > > > > > > > > > >