[
https://issues.apache.org/jira/browse/FLINK-4017?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15342547#comment-15342547
]
ASF GitHub Bot commented on FLINK-4017:
---------------------------------------
Github user zentol commented on a diff in the pull request:
https://github.com/apache/flink/pull/2115#discussion_r67940565
--- Diff:
flink-libraries/flink-python/src/main/python/org/apache/flink/python/api/flink/plan/DataSet.py
---
@@ -192,6 +193,30 @@ def reduce(self, operator):
self._env._sets.append(child)
return child_set
+ def aggregate(self, aggregation, field):
+ """
+ Applies an Aggregate transformation (using a GroupReduceFunction)
on a non-grouped Tuple DataSet.
+ :param aggregation: The built-in aggregation function to apply on
the DataSet.
+ :param field: The index of the Tuple field on which to perform the
function.
+ :return: A GroupReduceOperator that represents the aggregated
DataSet.
+ """
+ child_set = self.reduce_group(aggregation(field), combinable=True)
--- End diff --
there should also be a test case for a non-grouped aggregation.
> [py] Add Aggregation support to Python API
> ------------------------------------------
>
> Key: FLINK-4017
> URL: https://issues.apache.org/jira/browse/FLINK-4017
> Project: Flink
> Issue Type: Improvement
> Components: Python API
> Reporter: Geoffrey Mon
> Priority: Minor
>
> Aggregations are not currently supported in the Python API.
> I was getting started with setting up and working with Flink and figured this
> would be a relatively simple task for me to get started with. Currently
> working on this at https://github.com/geofbot/flink
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)