If we come up with a general approach in the context of the Flink runner,
perhaps that piece can go back to the "runner-core" component and be
adopted more widely.

On Wed, Apr 20, 2016 at 8:13 AM, Kenneth Knowles <[email protected]>
wrote:

> Hi Aljoscha,
>
> Great idea.
>
>  - The logic for matching up the windows is WindowFn#getSideInputWindow [1]
>  - The SDK used to have something along the lines of what you describe [2]
> but we thought it was too runner-specific, directly referencing Dataflow
> details, and with a particular model of buffering + timer. Perhaps it is a
> starting place for your design?
>
> Kenn
>
> [1]
>
> https://github.com/apache/incubator-beam/blob/master/sdks/java/core/src/main/java/org/apache/beam/sdk/transforms/windowing/WindowFn.java#L131
>
> [2]
>
> https://github.com/GoogleCloudPlatform/DataflowJavaSDK/blob/1e5524a7f5d0d774488cb0206ea6433085461775/sdk/src/main/java/com/google/cloud/dataflow/sdk/runners/worker/StreamingSideInputDoFnRunner.java
>
> On Wed, Apr 20, 2016 at 4:25 AM, Jean-Baptiste Onofré <[email protected]>
> wrote:
>
> > Hi Aljoscha
> >
> > AFAIR, the Runner API Proposal document (from Kenneth) contains some
> > points about side input.
> >
> >
> >
> https://drive.google.com/folderview?id=0B-IhJZh9Ab52OFBVZHpsNjc4eXc&usp=sharing
> >
> > I don't think it goes into the details of side inputs and windows, but
> > definitely the document we should extend.
> >
> > Regards
> > JB
> >
> >
> >
> > On 04/20/2016 11:55 AM, Aljoscha Krettek wrote:
> >
> >> Hi,
> >> for https://issues.apache.org/jira/browse/BEAM-102 we will need to have
> >> some functionality that deals with side inputs and windows (of both the
> >> main input and the side inputs) and how they get matched and how we wait
> >> for windows (blocking). I imagine that we could add some component that
> is
> >> similar to ReduceFnRunner but for side inputs: We would just instantiate
> >> it
> >> with a factory for state storage, then push elements into it while
> >> processing and it would provide a way to get a SideInputReader.
> >>
> >> I think this would not be specific to the Flink runner because other
> >> runner
> >> implementors will face similar problems. Are there any ideas/design docs
> >> about such a thing already? If not, we should probably start designing.
> >>
> >> What do you think?
> >>
> >> Cheers,
> >> Aljoscha
> >>
> >>
> > --
> > Jean-Baptiste Onofré
> > [email protected]
> > http://blog.nanthrax.net
> > Talend - http://www.talend.com
> >
>

Reply via email to