Re: Partitioning key range

2019-04-08 Thread Davood Rafiei
Hi all, Thanks a lot for the replies! On Mon, Apr 8, 2019 at 5:15 PM Ken Krugler wrote: > Hi Davood, > > We have done some explicit partitioning in the past, but it’s pretty > fragile. > > See FlinkUtils#makeKeyForOperatorIndex >

Re: Partitioning key range

2019-04-08 Thread Ken Krugler
Hi Davood, We have done some explicit partitioning in the past, but it’s pretty fragile. See FlinkUtils#makeKeyForOperatorIndex Though I haven’t tried this

Re: Partitioning key range

2019-04-08 Thread Fabian Hueske
Hi Davood, Flink uses hash partitioning to assign keys to key groups. Each key group is then assigned to a task for processing (a task might process multiple key groups). There is no way to directly assign a key to a particular key group or task. All you can do is to experiment with different

Re: Partitioning key range

2019-04-06 Thread Congxian Qiu
Hi Davood Maybe a custom KeySelector can be helpful, you can define the key used to partition the stream. You can ref the code[1] for detail. [1] 

Partitioning key range

2019-04-04 Thread Davood Rafiei
Hi all, I partition DataStream (say dsA) with parallelism 2 and get KeyedStream (say ksA) with parallelism 2. Depending on my keys in dsA, one partition remains empty in ksA. For example when my keys are 10 and 20 in dsA, then both partitions in ksA are full. However, with keys 1000 and 1001,