GuangMing Lu created HIVE-25269: ----------------------------------- Summary: When the skew and parallel parameters are true simultaneously, the result is less data Key: HIVE-25269 URL: https://issues.apache.org/jira/browse/HIVE-25269 Project: Hive Issue Type: Bug Components: Physical Optimizer, SQL Affects Versions: 3.1.2, 3.1.0 Reporter: GuangMing Lu Attachments: table_a_data.orc, table_b_data.orc, table_c_data.orc, table_d_data.orc, test.sql
When the params of hive.optimize.skewjoin, hive.groupby.skewindata and hive.exec.parallel are true, and exec sql such as 'INSERT... FROM (SUBQUERY UNIONALL ...GROUP BY...) A JOIN/LEFT JOIN A.expression', result data will be reduced. Details of SQL and test data can be found in the attachment -- This message was sent by Atlassian Jira (v8.3.4#803005)