----------------------------------------------------------- This is an automatically generated e-mail. To reply, visit: https://reviews.apache.org/r/30018/ -----------------------------------------------------------
Review request for hive. Bugs: HIVE-4809 https://issues.apache.org/jira/browse/HIVE-4809 Repository: hive-git Description ------- For example, we have a simple query like this ... {code:sql} SELECT x.a, x.b, count(x.b) OVER (PARTITION BY x.a) FROM src x; {\code} The plan of it is ... {code} STAGE DEPENDENCIES: Stage-1 is a root stage Stage-0 is a root stage STAGE PLANS: Stage: Stage-1 Map Reduce Alias -> Map Operator Tree: x TableScan alias: x Reduce Output Operator key expressions: expr: a type: int expr: a type: int sort order: ++ Map-reduce partition columns: expr: a type: int tag: -1 value expressions: expr: a type: int expr: b type: string Reduce Operator Tree: Extract PTF Operator Select Operator expressions: expr: _col0 type: int expr: _col1 type: string expr: _wcol0 type: bigint outputColumnNames: _col0, _col1, _col2 File Output Operator compressed: false GlobalTableId: 0 table: input format: org.apache.hadoop.mapred.TextInputFormat output format: org.apache.hadoop.hive.ql.io.HiveIgnoreKeyTextOutputFormat Stage: Stage-0 Fetch Operator limit: -1 {\code} The ReduceSinkOperator has two "a" in its key columns. This redundancy can increase the size of map output. Diffs ----- ql/src/java/org/apache/hadoop/hive/ql/parse/PTFTranslator.java d1d866f ql/src/java/org/apache/hadoop/hive/ql/parse/SemanticAnalyzer.java cea86df ql/src/test/results/clientpositive/correlationoptimizer12.q.out 015e021 ql/src/test/results/clientpositive/ctas_colname.q.out 079b981 ql/src/test/results/clientpositive/groupby_resolution.q.out 491593f ql/src/test/results/clientpositive/quotedid_basic.q.out 4384faa ql/src/test/results/clientpositive/spark/subquery_in.q.out d2269f3 ql/src/test/results/clientpositive/spark/vectorized_ptf.q.out 4f95e3f ql/src/test/results/clientpositive/subquery_in.q.out 35c8c0b ql/src/test/results/clientpositive/subquery_in_having.q.out 28087dd ql/src/test/results/clientpositive/subquery_notin.q.out 49fce1c ql/src/test/results/clientpositive/subquery_unqualcolumnrefs.q.out 24be8ec ql/src/test/results/clientpositive/tez/subquery_in.q.out 59a846d ql/src/test/results/clientpositive/tez/vectorized_ptf.q.out 2363b02 ql/src/test/results/clientpositive/union_remove_6_subq.q.out 63598f9 ql/src/test/results/clientpositive/vectorized_ptf.q.out 3d60d7f ql/src/test/results/clientpositive/windowing_streaming.q.out 6a55352 Diff: https://reviews.apache.org/r/30018/diff/ Testing ------- Thanks, Navis Ryu