-----------------------------------------------------------
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/30018/
-----------------------------------------------------------

Review request for hive.


Bugs: HIVE-4809
    https://issues.apache.org/jira/browse/HIVE-4809


Repository: hive-git


Description
-------

For example, we have a simple query like this ...
{code:sql}
SELECT x.a, x.b, count(x.b) OVER (PARTITION BY x.a) FROM src x;
{\code}

The plan of it is ...
{code}
STAGE DEPENDENCIES:
  Stage-1 is a root stage
  Stage-0 is a root stage

STAGE PLANS:
  Stage: Stage-1
    Map Reduce
      Alias -> Map Operator Tree:
        x 
          TableScan
            alias: x
            Reduce Output Operator
              key expressions:
                    expr: a
                    type: int
                    expr: a
                    type: int
              sort order: ++
              Map-reduce partition columns:
                    expr: a
                    type: int
              tag: -1
              value expressions:
                    expr: a
                    type: int
                    expr: b
                    type: string
      Reduce Operator Tree:
        Extract
          PTF Operator
            Select Operator
              expressions:
                    expr: _col0
                    type: int
                    expr: _col1
                    type: string
                    expr: _wcol0
                    type: bigint
              outputColumnNames: _col0, _col1, _col2
              File Output Operator
                compressed: false
                GlobalTableId: 0
                table:
                    input format: org.apache.hadoop.mapred.TextInputFormat
                    output format: 
org.apache.hadoop.hive.ql.io.HiveIgnoreKeyTextOutputFormat

  Stage: Stage-0
    Fetch Operator
      limit: -1
{\code}

The ReduceSinkOperator has two "a" in its key columns. This redundancy can 
increase the size of map output.


Diffs
-----

  ql/src/java/org/apache/hadoop/hive/ql/parse/PTFTranslator.java d1d866f 
  ql/src/java/org/apache/hadoop/hive/ql/parse/SemanticAnalyzer.java cea86df 
  ql/src/test/results/clientpositive/correlationoptimizer12.q.out 015e021 
  ql/src/test/results/clientpositive/ctas_colname.q.out 079b981 
  ql/src/test/results/clientpositive/groupby_resolution.q.out 491593f 
  ql/src/test/results/clientpositive/quotedid_basic.q.out 4384faa 
  ql/src/test/results/clientpositive/spark/subquery_in.q.out d2269f3 
  ql/src/test/results/clientpositive/spark/vectorized_ptf.q.out 4f95e3f 
  ql/src/test/results/clientpositive/subquery_in.q.out 35c8c0b 
  ql/src/test/results/clientpositive/subquery_in_having.q.out 28087dd 
  ql/src/test/results/clientpositive/subquery_notin.q.out 49fce1c 
  ql/src/test/results/clientpositive/subquery_unqualcolumnrefs.q.out 24be8ec 
  ql/src/test/results/clientpositive/tez/subquery_in.q.out 59a846d 
  ql/src/test/results/clientpositive/tez/vectorized_ptf.q.out 2363b02 
  ql/src/test/results/clientpositive/union_remove_6_subq.q.out 63598f9 
  ql/src/test/results/clientpositive/vectorized_ptf.q.out 3d60d7f 
  ql/src/test/results/clientpositive/windowing_streaming.q.out 6a55352 

Diff: https://reviews.apache.org/r/30018/diff/


Testing
-------


Thanks,

Navis Ryu

Reply via email to