Suprith created HIVE-24915:
------------------------------
Summary: Distribute by with sort by clause when used with constant
as the first parameter for sort produces wrong result.
Key: HIVE-24915
URL: https://issues.apache.org/jira/browse/HIVE-24915
Project: Hive
Issue Type: Bug
Affects Versions: 2.3.4
Reporter: Suprith
Assignee: Suprith
Distribute by with sort by clause when used with constant as the first
parameter for sort produces wrong result.
Example:
{code:java}
SELECT
t.time,
'a' as const
FROM
(SELECT 1591819264 as time
UNION ALL
SELECT 1591819265 as time) t
DISTRIBUTE by const
sort by const, t.time
{code}
Produces
|{color:#000000}*time*{color}|{color:#000000}*const*{color}|
| NULL|{color:#000000}a{color}|
| NULL|{color:#000000}a{color}|
Instead it should produce:
|{color:#000000}*time*{color}|{color:#000000}*const*{color}|
|{color:#000000}*1591819264*{color}|{color:#000000}a{color}|
|{color:#000000}*1591819265*{color}|{color:#000000}a{color}|
Incorrect sort columns are used while creating ReduceSink
https://github.com/apache/hive/blob/master/ql/src/java/org/apache/hadoop/hive/ql/parse/SemanticAnalyzer.java#L9066
--
This message was sent by Atlassian Jira
(v8.3.4#803005)