Hi Thomas, Does it mean future Pipeline implementations would allow applications to set the runner after a pipeline has been constructed?
Thanks, Shen On Tue, Aug 15, 2017 at 12:36 PM, Thomas Groh <tg...@google.com.invalid> wrote: > This style of method doesn't fit with the current approach of pipeline > construction, where the PipelineRunner need not be specified until the > pipeline is run; as such, the runner can't observe the construction of the > Pipeline, as it may not exist during the construction of the Pipeline. > > On Tue, Aug 15, 2017 at 8:41 AM, Eugene Kirpichov < > kirpic...@google.com.invalid> wrote: > > > ... And remember it and make available inside PCollection (which > > application produced this collection). > > > > On Tue, Aug 15, 2017, 8:39 AM Eugene Kirpichov <kirpic...@google.com> > > wrote: > > > > > In general, no - but the implementation of PAssertionSite exemplifies > the > > > approach. I guess it could be useful to make this a general beam > feature > > > and remember it for all transforms. It would probably be best to > > implement > > > inside Pipeline.apply(). > > > > > > On Tue, Aug 15, 2017, 7:02 AM Shen Li <cs.she...@gmail.com> wrote: > > > > > >> Hi Eugene, > > >> > > >> Thanks for sharing the info. That PAssertionSite tracks where an > > assertion > > >> error occurred. Do you know if it is possible to get the class name > and > > >> line number where a PTransform was added? > > >> > > >> Thanks, > > >> Shen > > >> > > >> On Mon, Aug 14, 2017 at 10:54 PM, Eugene Kirpichov < > > >> kirpic...@google.com.invalid> wrote: > > >> > > >> > Hi Shen, > > >> > Responding just to one part of your message - "remember the line at > > >> which > > >> > the PTransform was added": take a look at > > >> > https://github.com/apache/beam/pull/2247 which does this for > PAssert. > > >> > > > >> > On Mon, Aug 14, 2017 at 7:32 PM Shen Li <cs.she...@gmail.com> > wrote: > > >> > > > >> > > In 0.5.0 or earlier releases, PipelineRunner provides an > > >> > > apply(PTransform<InputT, OutputT>, InputT) method which allows > > runner > > >> > > implementation to perform actions when the PTransform is added to > > the > > >> > > pipeline. In later releases, that apply(...) method has been > > replaced > > >> by > > >> > a > > >> > > Pipeline#replaceAll() method, where the runner can only get > involved > > >> > after > > >> > > the pipeline has been fully constructed. In terms of override > > >> > PTransforms, > > >> > > these two APIs are identical. But, the early API could still be > > >> helpful. > > >> > > For example, the runner could remember the line at which the > > >> PTransform > > >> > was > > >> > > added, and provide that info to users to assist debugging. Is it > > >> possible > > >> > > to add that API back? Or is there any other way to involve the > > runner > > >> > when > > >> > > a PTransform is added? > > >> > > > > >> > > Thanks, > > >> > > Shen > > >> > > > > >> > > > >> > > > > > >