Github user shaoxuan-wang commented on the issue:

    https://github.com/apache/flink/pull/3423
  
    @fhueske  thanks for the review. I completely agree with your suggestion on 
 "reworking the batch design". Actually I have proposed the same idea to 
"rework the batch" before your review (maybe you have missed my comment on 
Feb.27). I was hesitating to make the changes, as I want to keep this PR as 
dedicated as possible. But since the performance of the current design is a 
concern, let's do the clean up all together within this PR.  Regarding to 
"pairwise merging", I have different opinion, there are many aggregates that 
merging a list at once is much more efficient than merge just two. If we always 
use "pairwise merging" in the runtime, we will lose the advantage of 
merge(List) API defined in AggregateFunction interface. If the memory is a 
concern, we can limit the size of List for each merge iteration, I will provide 
the update very soon.


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---

Reply via email to