Re: key_by problem in Pyflink

2021-07-16 Thread Fei Zhao
Hi again, It seems the temporary solution I mentioned in last mail doesn't work steadily. I am wondering that whether all states should be accessed in `process_element1`, `process_element2` and `on_timer` (if a timer is registered) or not ? Or is there any other suggested workaround? Thanks. Fe

Re: key_by problem in Pyflink

2021-07-13 Thread Fei Zhao
Hi, Thanks for your explanation! Adding a line `self.data.contains('xxx')` in the `process_element2` and all goes well. I will take this as my temporary solution. Looking forward to the next release. Best Regards, Fei Xingbo Huang 于2021年7月13日周二 下午4:18写道: > Hi, > > I have created the JIRA[1]

Re: key_by problem in Pyflink

2021-07-13 Thread Xingbo Huang
Hi, I have created the JIRA[1] to fix this bug which will be included in release-1.13.2. The root cause is the wrong mapping of the state key to the state. This kind of wrong mapping occurs when the key is switched, but the state is not used. As you wrote in the example, the `data` you declared is

Re: key_by problem in Pyflink

2021-07-12 Thread 赵飞
Thanks. In addition, I run the program in a local mini cluster mode, not sure if it would affect the results. Xingbo Huang 于2021年7月12日周一 下午9:02写道: > Hi, > > I think your understanding is correct. The results seem a little wired. > I'm looking into this and will let you know when there are any fi

Re: key_by problem in Pyflink

2021-07-12 Thread Xingbo Huang
Hi, I think your understanding is correct. The results seem a little wired. I'm looking into this and will let you know when there are any findings. Best, Xingbo 赵飞 于2021年7月12日周一 下午4:48写道: > Hi all, > I'm using pyflink to develop a module, whose main functionality is > processing user data bas