[
https://issues.apache.org/jira/browse/FLINK-5768?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15889775#comment-15889775
]
ASF GitHub Bot commented on FLINK-5768:
---------------------------------------
Github user fhueske commented on the issue:
https://github.com/apache/flink/pull/3423
Hi @shaoxuan-wang, sorry. I missed your comment. If you haven't started
reworking the batch aggregations yet, I agree to do it later and just change
the merging to smaller batches.
So far, we always put an emphasis on robustness and tried to avoid memory
issues as much as possible. In batch, most of the JVM memory is maintained by
Flink and not available for regular user-function objects. So I'd suggest to be
a bit conservative here and batch 16 rows. We can also run a few benchmarks to
check how much the parameter affects the performance.
> Apply new aggregation functions for datastream and dataset tables
> -----------------------------------------------------------------
>
> Key: FLINK-5768
> URL: https://issues.apache.org/jira/browse/FLINK-5768
> Project: Flink
> Issue Type: Sub-task
> Components: Table API & SQL
> Reporter: Shaoxuan Wang
> Assignee: Shaoxuan Wang
>
> Apply new aggregation functions for datastream and dataset tables
> This includes:
> 1. Change the implementation of the DataStream aggregation runtime code to
> use new aggregation functions and aggregate dataStream API.
> 2. DataStream will be always running in incremental mode, as explained in
> 06/Feb/2017 in FLINK5564.
> 2. Change the implementation of the Dataset aggregation runtime code to use
> new aggregation functions.
> 3. Clean up unused class and method.
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)