Hi folks, Currently, Aljoscha, Stephan, and I are reworking the DataStream API as discussed before. Things are a bit in-flight right now with several commits and pull requests, and the current master containing code from both the old and the new API.
I want to give you an idea of how the new API will look like. This is a very rough draft of the new documentation page (also a WiP): https://www.dropbox.com/sh/t5nvlx7meadppnp/AAD5sEIH5S3QNYTiMsyE9KBva?dl=0 Compared to the current API, the major changes include: - Different syntax (and implementation) for windows. Old constructs will be replaced by the new ones. The new syntax resembles Google's Dataflow model, but contains "shortcuts" as syntactic sugar for common cases - Different syntax (and implementation) of "grouping". New terminology will be KeyedDataStream (and "keyBy") which will replace GroupedDataStream. - Reduced functionality in ConnectedDataStream - only map and flatMap - New syntax (and implementation) for window joins, removal of cross - No changes in iterations besides deleting the "long milliseconds" argument - No changes in state - Deletion of "DataSet.forward() and .global()" - Windows can only come after keyBy, otherwise they are DOP-1 operators and are defined as "windowAll" Best, Kostas