[
https://issues.apache.org/jira/browse/SPARK-27653?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17259806#comment-17259806
]
Michael Park commented on SPARK-27653:
--------------------------------------
These functions appear to be missing from functions.scala and functions.py. Was
that on purpose or just an oversight?
> Add max_by() / min_by() SQL aggregate functions
> -----------------------------------------------
>
> Key: SPARK-27653
> URL: https://issues.apache.org/jira/browse/SPARK-27653
> Project: Spark
> Issue Type: New Feature
> Components: SQL
> Affects Versions: 3.0.0
> Reporter: Josh Rosen
> Assignee: L. C. Hsieh
> Priority: Major
> Fix For: 3.0.0
>
>
> It would be useful if Spark SQL supported the {{max_by()}} SQL aggregate
> function. Quoting from the [Presto
> docs|https://prestodb.github.io/docs/current/functions/aggregate.html#max_by]:
> {quote}max_by(x, y) → [same as x]
> Returns the value of x associated with the maximum value of y over all input
> values.
> {quote}
> {{min_by}} works similarly.
> Technically I can emulate this behavior using window functions but the
> resulting syntax is much more verbose and non-intuitive compared to
> {{max_by}} / {{min_by}}.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]