[
https://issues.apache.org/jira/browse/HIVE-28489?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17894391#comment-17894391
]
Seonggon Namgung commented on HIVE-28489:
-----------------------------------------
[~zabetak] , PR#5424 is ready for review.
And I changed the file format. Please let me know if there remain any issues
with the slides.
> Partitioning the input data of Grouping Set GroupBy operator
> ------------------------------------------------------------
>
> Key: HIVE-28489
> URL: https://issues.apache.org/jira/browse/HIVE-28489
> Project: Hive
> Issue Type: New Feature
> Reporter: Seonggon Namgung
> Assignee: Seonggon Namgung
> Priority: Major
> Labels: pull-request-available
> Attachments: 2.PartitionDataBeforeGroupingSet.pdf
>
>
> GroupBy operator with grouping sets often emits too many rows, which becomes
> the bottleneck of query execution. To reduce the number output rows, this
> JIRA proposes partitioning the input data of such GroupBy operator.
> Please check out the attached slides for detailed explanation.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)