Re: Add partitionedKeyBy to DataStream

2016-10-20 Thread Till Rohrmann
Hi Xiaowei, I like the idea to reuse a partitioning and thus saving a shuffle operation. It would be great if we could fail at runtime in case the partitioning changed somehow. That way a logical user failure won't go unnoticed. Would it make sense to name the method partitionedByKey(...)

Add partitionedKeyBy to DataStream

2016-10-20 Thread Xiaowei Jiang
After we do any interesting operations (e.g. reduce) on KeyedStream, the result becomes DataStream. In a lot of cases, the output still has the same or compatible keys with the KeyedStream (logically). But to do further operations on these keys, we are forced to use keyby again. This works

[jira] [Created] (FLINK-4855) Add partitionedKeyBy to DataStream

2016-10-18 Thread Xiaowei Jiang (JIRA)
Xiaowei Jiang created FLINK-4855: Summary: Add partitionedKeyBy to DataStream Key: FLINK-4855 URL: https://issues.apache.org/jira/browse/FLINK-4855 Project: Flink Issue Type: Improvement