[
https://issues.apache.org/jira/browse/HIVE-12369?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Matt McCline updated HIVE-12369:
--------------------------------
Summary: Native Vector GroupBy (was: Native Vector GroupBy (Part 1))
> Native Vector GroupBy
> ---------------------
>
> Key: HIVE-12369
> URL: https://issues.apache.org/jira/browse/HIVE-12369
> Project: Hive
> Issue Type: Bug
> Components: Hive
> Reporter: Matt McCline
> Assignee: Matt McCline
> Priority: Critical
> Attachments: HIVE-12369.01.patch, HIVE-12369.02.patch,
> HIVE-12369.05.patch, HIVE-12369.06.patch, HIVE-12369.091.patch,
> HIVE-12369.094.patch, HIVE-12369.095.patch, HIVE-12369.096.patch,
> HIVE-12369.097.patch, HIVE-12369.098.patch, HIVE-12369.099.patch,
> HIVE-12369.0991.patch, HIVE-12369.0992.patch, HIVE-12369.0993.patch
>
>
> Implement Native Vector GroupBy using fast hash table technology developed
> for Native Vector MapJoin, etc.
> Patch is currently limited to a single COUNT aggregation. Or, no aggregation
> also known as duplicate reduction.
> Here are examples of new kinds of classes introduces that stored the count in
> the slot table and don't allocate hash elements:
> {noformat}
> COUNT(column) VectorGroupByHashLongKeySingleCountColumnOperator
> COUNT(key) VectorGroupByHashLongKeySingleCountKeyOperator
> COUNT(*) VectorGroupByHashLongKeySingleCountStarOperator
> {noformat}
> And the duplicate reduction operator a single key. Example:
> {noformat}
> VectorGroupByHashLongKeyDuplicateReductionOperator
> {noformat}
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)