Hi folks,

Currently, Aljoscha, Stephan, and I are reworking the DataStream API as
discussed before. Things are a bit in-flight right now with several commits
and pull requests, and the current master containing code from both the old
and the new API.

I want to give you an idea of how the new API will look like. This is a
very rough draft of the new documentation page (also a WiP):

https://www.dropbox.com/sh/t5nvlx7meadppnp/AAD5sEIH5S3QNYTiMsyE9KBva?dl=0

Compared to the current API, the major changes include:

- Different syntax (and implementation) for windows. Old constructs will be
replaced by the new ones. The new syntax resembles Google's Dataflow model,
but contains "shortcuts" as syntactic sugar for common cases

- Different syntax (and implementation) of "grouping". New terminology will
be KeyedDataStream (and "keyBy") which will replace GroupedDataStream.

- Reduced functionality in ConnectedDataStream - only map and flatMap

- New syntax (and implementation) for window joins, removal of cross

- No changes in iterations besides deleting the "long milliseconds" argument

- No changes in state

- Deletion of "DataSet.forward() and .global()"

- Windows can only come after keyBy, otherwise they are DOP-1 operators and
are defined as "windowAll"

Best,
Kostas

Reply via email to