[
https://issues.apache.org/jira/browse/HADOOP-4156?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Namit Jain reassigned HADOOP-4156:
----------------------------------
Assignee: Namit Jain
> [hive] duplicate expression elimination for group by stage 1
> ------------------------------------------------------------
>
> Key: HADOOP-4156
> URL: https://issues.apache.org/jira/browse/HADOOP-4156
> Project: Hadoop Core
> Issue Type: Bug
> Reporter: Namit Jain
> Assignee: Namit Jain
>
> In the first job we evaluate all the input columns + all group by clause
> expressions + parameters to all the aggregation functions and do not
> eliminate all the duplicates because we treat expression resolution and
> column resolution differently.
> Consider the following:
> src(key, value)
> select src.key, sum(src.value) from src group by src;
> Both src.key and src.value will be added twice - one from src's row resolver
> and one each from group expression and parameter. It is needed that way
> filter looks at (table, column) in row resolver, whereas group by expression
> looks
> at (", COLREF table column)
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.