Column pruning after join+group-by query
----------------------------------------

                 Key: HIVE-284
                 URL: https://issues.apache.org/jira/browse/HIVE-284
             Project: Hadoop Hive
          Issue Type: Improvement
    Affects Versions: 0.2.0, 0.3.0
            Reporter: Zheng Shao


The query is:

explain INSERT OVERWRITE TABLE t
    SELECT t1.r, t2.c, sum(t1.v * t2.v)
    FROM t1 join t2 on t1.c = t2.r
GROUP BY t1.r, t2.c;

The FileSinkOperator after the join is serializing all 6 columns from the 2 
tables (both have 3 columns: r, c, v) instead of the 4 that is needed later.


-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to