Davies Liu created SPARK-11425:
----------------------------------
Summary: Improve hybrid aggregation (sort-based after hash-based)
Key: SPARK-11425
URL: https://issues.apache.org/jira/browse/SPARK-11425
Project: Spark
Issue Type: Improvement
Reporter: Davies Liu
After aggregation, the dataset could be smaller than inputs, so it's better to
do hash based aggregation for all inputs, then using sort based aggregation to
merge them.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]