Pablo Langa Blanco created SPARK-34961:
------------------------------------------
Summary: Migrate First function from DeclarativeAggregate to
TypedImperativeAggregate to improve performance
Key: SPARK-34961
URL: https://issues.apache.org/jira/browse/SPARK-34961
Project: Spark
Issue Type: Improvement
Components: SQL
Affects Versions: 3.2.0
Reporter: Pablo Langa Blanco
The main objective of this change is to improve performance in some cases.
We have three possibilities when we plan an aggregation. In the first case,
with mutable primitive types, HashAggregate is used.
When we are not using these types we have two options. If the function
implements TypedImperativeAggregate we use ObjectHashAggregate. Otherwise, we
use SortAggregate that is less efficient.
In this PR I propose to migrate First function to implement
TypedImperativeAggregate to take advantage of this feature (ObjectAggregateExec)
This Jira is related to SPARK-34464
--
This message was sent by Atlassian Jira
(v8.3.4#803005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]