GuangMing Lu created HIVE-25269:
-----------------------------------

             Summary: When the skew and parallel parameters are true 
simultaneously, the result is less data
                 Key: HIVE-25269
                 URL: https://issues.apache.org/jira/browse/HIVE-25269
             Project: Hive
          Issue Type: Bug
          Components: Physical Optimizer, SQL
    Affects Versions: 3.1.2, 3.1.0
            Reporter: GuangMing Lu
         Attachments: table_a_data.orc, table_b_data.orc, table_c_data.orc, 
table_d_data.orc, test.sql

When the params of hive.optimize.skewjoin, hive.groupby.skewindata and 
hive.exec.parallel are true, and exec sql such as 'INSERT... FROM (SUBQUERY 
UNIONALL ...GROUP BY...) A JOIN/LEFT JOIN A.expression', result data will be 
reduced. Details of SQL and test data can be found in the attachment



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to