[jira] [Work logged] (HIVE-24664) Support column aliases in Values clause

ASF GitHub Bot (Jira) Mon, 25 Jan 2021 20:32:14 -0800


     [ 
https://issues.apache.org/jira/browse/HIVE-24664?focusedWorklogId=541901&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-541901
 ]


ASF GitHub Bot logged work on HIVE-24664:
-----------------------------------------

                Author: ASF GitHub Bot
            Created on: 26/Jan/21 04:31
            Start Date: 26/Jan/21 04:31
    Worklog Time Spent: 10m 
      Work Description: jcamachor commented on a change in pull request #1892:
URL: https://github.com/apache/hive/pull/1892#discussion_r564004991



##########
File path: 
itests/hive-blobstore/src/test/results/clientpositive/insert_into_dynamic_partitions.q.out
##########
@@ -100,7 +100,7 @@ STAGE PLANS:
             Statistics: Num rows: 1 Data size: 10 Basic stats: COMPLETE Column 
stats: COMPLETE
             GatherStats: false
             Select Operator
-              expressions: array(const struct(1,'101'),const 
struct(2,'202'),const struct(3,'303'),const struct(4,'404'),const 
struct(5,'505')) (type: array<struct<col1:int,col2:string>>)
+              expressions: 
array(named_struct('col1',1,'col2','101'),named_struct('col1',2,'col2','202'),named_struct('col1',3,'col2','303'),named_struct('col1',4,'col2','404'),named_struct('col1',5,'col2','505'))
 (type: array<struct<col1:int,col2:string>>)

Review comment:
       This should not change, i.e., it seems we are failing to recognize 
constant structs now. Iirc this is important because there are specific 
optimization on execution for some of these specific data structures (also for 
translation to other engines). The consolidation could possibly be done in 
constant folding (`ConstantPropagateProcFactory`) if the changes in the parser 
need to be done this way.

##########
File path: 
itests/hive-blobstore/src/test/results/clientpositive/insert_into_dynamic_partitions.q.out
##########
@@ -100,7 +100,7 @@ STAGE PLANS:
             Statistics: Num rows: 1 Data size: 10 Basic stats: COMPLETE Column 
stats: COMPLETE
             GatherStats: false
             Select Operator
-              expressions: array(const struct(1,'101'),const 
struct(2,'202'),const struct(3,'303'),const struct(4,'404'),const 
struct(5,'505')) (type: array<struct<col1:int,col2:string>>)
+              expressions: 
array(named_struct('col1',1,'col2','101'),named_struct('col1',2,'col2','202'),named_struct('col1',3,'col2','303'),named_struct('col1',4,'col2','404'),named_struct('col1',5,'col2','505'))
 (type: array<struct<col1:int,col2:string>>)

Review comment:
       This should not change, i.e., it seems we are failing to recognize 
constant structs now. Iirc this is important because there are specific 
optimization on execution for some of these specific data structures (also for 
translation to other engines). The consolidation could possibly be done when we 
are creating the Calcite plan?

##########
File path: ql/src/test/results/clientpositive/llap/values.q.out
##########
@@ -10,7 +10,7 @@ POSTHOOK: Input: _dummy_database@_dummy_table
 #### A masked pattern was here ####
 Explain
 CBO PLAN:
-HiveTableFunctionScan(invocation=[inline(ARRAY(ROW(1, 2, 3), ROW(4, 5, 6)))], 
rowType=[RecordType(INTEGER col1, INTEGER col2, INTEGER col3)])

Review comment:
       Related to the previous comment, then this would not change (it's 
important it is recognized as ROW).

##########
File path: parser/src/java/org/apache/hadoop/hive/ql/parse/IdentifiersParser.g
##########
@@ -151,11 +182,46 @@ expressionsNotInParenthesis[boolean isStruct, boolean 
forceStruct]
     -> {$more.tree}
     ;
 
-expressionPart[CommonTree t, boolean isStruct]
+expressionPart[CommonTree firstExprTree, boolean isStruct]
     :
     (COMMA expression)+
-    -> {isStruct}? ^(TOK_FUNCTION Identifier["struct"] {$t} expression+)
-    -> {$t} expression+
+    -> {isStruct}? ^(TOK_FUNCTION Identifier["struct"] {$firstExprTree} 
expression+)
+    -> {$firstExprTree} expression+
+    ;
+
+// Parses comma separated list of expressions with optionally specified 
aliases and store the aliases for further usage.
+// <expression> [<alias>] [, <expression> [<alias>]]
+firstExpressionsWithAlias
+@init { initAliases(); }
+    :
+    first=expression colAlias=identifier? (COMMA expressionWithAlias)*
+    -> {colAlias != null}? ^(TOK_FUNCTION Identifier["named_struct"] { 
adaptor.create(Identifier, addAlias($colAlias.tree.getText())) } {$first.tree} 
expressionWithAlias*)

Review comment:
       Could we maybe identify this construct with a specific token? Would that 
help to identify later in the plan generation this case?




----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]


Issue Time Tracking
-------------------

    Worklog Id:     (was: 541901)
    Time Spent: 1h  (was: 50m)

> Support column aliases in Values clause
> ---------------------------------------
>
>                 Key: HIVE-24664
>                 URL: https://issues.apache.org/jira/browse/HIVE-24664
>             Project: Hive
>          Issue Type: Improvement
>            Reporter: Krisztian Kasa
>            Assignee: Krisztian Kasa
>            Priority: Major
>              Labels: pull-request-available
>          Time Spent: 1h
>  Remaining Estimate: 0h
>
> Enable explicitly specify column aliases in the first row of Values clause. 
> If not all the columns has alias specified generate one.
> {code:java}
> values(1, 2 b, 3 c),(4, 5, 6);
> {code}
> {code:java}
> _col1   b   c
>   1     2   3
>   4     5   6
> {code}
>  This is not an standard SQL feature but some database engines like Impala 
> supports it.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

[jira] [Work logged] (HIVE-24664) Support column aliases in Values clause

Reply via email to