Re: Question on NEXMark

2020-06-10 Thread Kenneth Knowles
It sounds like it could be something worth addressing. I don't really know the cost of this behavior. The pipeline is pretty easy to read. The pipeline itself does not explicitly manage any state, so it would be in the Flink execution of the GroupByKey primitive transform. The relevant code is

Re: Question on NEXMark

2020-06-10 Thread Andrew Pilloud
I think the author of this test is long gone, but the code originated inside google. This query is not part of the original Nexmark suite but was designed to exercise corner cases caused by out of order events, so that is what you are probably seeing. Here are relevant bits from the original

Question on NEXMark

2020-06-10 Thread Sruthi S Kumar
Hi, We are working on a Flink project and enhancing some state backend functionality. We are using NEXMark benchmark to compare different state backends performance of Flink. While running NEXMark queries using Flink runner of Beam we have noticed that there is quite a lot of non-existent read