Maintaining Stream Partitioning after Mapping?

Ryan Conway Wed, 12 Apr 2017 12:37:47 -0700

Greetings,

Is there a means of maintaining a stream's partitioning after running it
through an operation such as map or filter?


I have a pipeline stage S that operates on a stream partitioned by an ID
field. S flat maps objects of type A to type B, which both have an "ID"
field, and where each instance of B that S outputs has the same ID as its
input instance of A. I hope to add a pipeline stage T immediately after S
that operates using the same partitioning as S, so that I can avoid the
expense of re-keying the instances of type B.

If I am understanding the DataStream API correctly this is not feasible
with Flink, as map(), filter() etc. all output SingleOutputStreamOperator.
But I am hoping that I am missing something.

Thank you,
Ryan

Maintaining Stream Partitioning after Mapping?

Reply via email to