Suprith created HIVE-24915:
------------------------------

             Summary: Distribute by with sort by clause when used with constant 
as the first parameter for sort produces wrong result.
                 Key: HIVE-24915
                 URL: https://issues.apache.org/jira/browse/HIVE-24915
             Project: Hive
          Issue Type: Bug
    Affects Versions: 2.3.4
            Reporter: Suprith
            Assignee: Suprith


Distribute by with sort by clause when used with constant as the first 
parameter for sort produces wrong result.

Example: 


{code:java}
 SELECT 
    t.time,
    'a' as const
  FROM
    (SELECT 1591819264 as time
    UNION ALL
    SELECT 1591819265 as time) t
  DISTRIBUTE by const
  sort by const, t.time
{code}

Produces
 
|{color:#000000}*time*{color}|{color:#000000}*const*{color}|
| NULL|{color:#000000}a{color}|
| NULL|{color:#000000}a{color}|


Instead it should produce:
|{color:#000000}*time*{color}|{color:#000000}*const*{color}|
|{color:#000000}*1591819264*{color}|{color:#000000}a{color}|
|{color:#000000}*1591819265*{color}|{color:#000000}a{color}|





Incorrect sort columns are used while creating ReduceSink 
https://github.com/apache/hive/blob/master/ql/src/java/org/apache/hadoop/hive/ql/parse/SemanticAnalyzer.java#L9066




 

 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to