[jira] [Updated] (HIVE-25269) When the skew and parallel parameters are true simultaneously, the result is less data

2021-06-21 Thread GuangMing Lu (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-25269?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

GuangMing Lu updated HIVE-25269:

Attachment: P10IDS_RISKLIST.zip
p10ids_riskcon.zip
p10ids_realpayrc_ygz.zip
p10ids_prerec_split_ygz.zip
comb_classcode.zip

> When the skew and parallel parameters are true simultaneously, the result is 
> less data
> --
>
> Key: HIVE-25269
> URL: https://issues.apache.org/jira/browse/HIVE-25269
> Project: Hive
>  Issue Type: Bug
>  Components: Physical Optimizer, SQL
>Affects Versions: 3.1.0, 3.1.2
>Reporter: GuangMing Lu
>Priority: Major
> Attachments: P10IDS_RISKLIST.zip, comb_classcode.zip, 
> p10ids_prerec_split_ygz.zip, p10ids_realpayrc_ygz.zip, p10ids_riskcon.zip, 
> test.sql
>
>
> When the params of hive.optimize.skewjoin, hive.groupby.skewindata and 
> hive.exec.parallel are true, and exec sql such as 'INSERT... FROM (SUBQUERY 
> UNIONALL ...GROUP BY...) A JOIN/LEFT JOIN A.expression', result data will be 
> reduced. Details of SQL and test data can be found in the attachment



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (HIVE-25269) When the skew and parallel parameters are true simultaneously, the result is less data

2021-06-21 Thread GuangMing Lu (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-25269?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

GuangMing Lu updated HIVE-25269:

Attachment: test.sql

> When the skew and parallel parameters are true simultaneously, the result is 
> less data
> --
>
> Key: HIVE-25269
> URL: https://issues.apache.org/jira/browse/HIVE-25269
> Project: Hive
>  Issue Type: Bug
>  Components: Physical Optimizer, SQL
>Affects Versions: 3.1.0, 3.1.2
>Reporter: GuangMing Lu
>Priority: Major
> Attachments: test.sql
>
>
> When the params of hive.optimize.skewjoin, hive.groupby.skewindata and 
> hive.exec.parallel are true, and exec sql such as 'INSERT... FROM (SUBQUERY 
> UNIONALL ...GROUP BY...) A JOIN/LEFT JOIN A.expression', result data will be 
> reduced. Details of SQL and test data can be found in the attachment



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (HIVE-25269) When the skew and parallel parameters are true simultaneously, the result is less data

2021-06-21 Thread GuangMing Lu (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-25269?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

GuangMing Lu updated HIVE-25269:

Attachment: (was: comb_classcode.data)

> When the skew and parallel parameters are true simultaneously, the result is 
> less data
> --
>
> Key: HIVE-25269
> URL: https://issues.apache.org/jira/browse/HIVE-25269
> Project: Hive
>  Issue Type: Bug
>  Components: Physical Optimizer, SQL
>Affects Versions: 3.1.0, 3.1.2
>Reporter: GuangMing Lu
>Priority: Major
>
> When the params of hive.optimize.skewjoin, hive.groupby.skewindata and 
> hive.exec.parallel are true, and exec sql such as 'INSERT... FROM (SUBQUERY 
> UNIONALL ...GROUP BY...) A JOIN/LEFT JOIN A.expression', result data will be 
> reduced. Details of SQL and test data can be found in the attachment



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (HIVE-25269) When the skew and parallel parameters are true simultaneously, the result is less data

2021-06-21 Thread GuangMing Lu (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-25269?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

GuangMing Lu updated HIVE-25269:

Attachment: (was: 样例分析-表入数据.sql)

> When the skew and parallel parameters are true simultaneously, the result is 
> less data
> --
>
> Key: HIVE-25269
> URL: https://issues.apache.org/jira/browse/HIVE-25269
> Project: Hive
>  Issue Type: Bug
>  Components: Physical Optimizer, SQL
>Affects Versions: 3.1.0, 3.1.2
>Reporter: GuangMing Lu
>Priority: Major
>
> When the params of hive.optimize.skewjoin, hive.groupby.skewindata and 
> hive.exec.parallel are true, and exec sql such as 'INSERT... FROM (SUBQUERY 
> UNIONALL ...GROUP BY...) A JOIN/LEFT JOIN A.expression', result data will be 
> reduced. Details of SQL and test data can be found in the attachment



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (HIVE-25269) When the skew and parallel parameters are true simultaneously, the result is less data

2021-06-21 Thread GuangMing Lu (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-25269?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

GuangMing Lu updated HIVE-25269:

Attachment: 样例分析-表入数据.sql

> When the skew and parallel parameters are true simultaneously, the result is 
> less data
> --
>
> Key: HIVE-25269
> URL: https://issues.apache.org/jira/browse/HIVE-25269
> Project: Hive
>  Issue Type: Bug
>  Components: Physical Optimizer, SQL
>Affects Versions: 3.1.0, 3.1.2
>Reporter: GuangMing Lu
>Priority: Major
> Attachments: comb_classcode.data, 样例分析-表入数据.sql
>
>
> When the params of hive.optimize.skewjoin, hive.groupby.skewindata and 
> hive.exec.parallel are true, and exec sql such as 'INSERT... FROM (SUBQUERY 
> UNIONALL ...GROUP BY...) A JOIN/LEFT JOIN A.expression', result data will be 
> reduced. Details of SQL and test data can be found in the attachment



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (HIVE-25269) When the skew and parallel parameters are true simultaneously, the result is less data

2021-06-21 Thread GuangMing Lu (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-25269?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

GuangMing Lu updated HIVE-25269:

Attachment: (was: table_b_data.orc)

> When the skew and parallel parameters are true simultaneously, the result is 
> less data
> --
>
> Key: HIVE-25269
> URL: https://issues.apache.org/jira/browse/HIVE-25269
> Project: Hive
>  Issue Type: Bug
>  Components: Physical Optimizer, SQL
>Affects Versions: 3.1.0, 3.1.2
>Reporter: GuangMing Lu
>Priority: Major
> Attachments: comb_classcode.data, 样例分析-表入数据.sql
>
>
> When the params of hive.optimize.skewjoin, hive.groupby.skewindata and 
> hive.exec.parallel are true, and exec sql such as 'INSERT... FROM (SUBQUERY 
> UNIONALL ...GROUP BY...) A JOIN/LEFT JOIN A.expression', result data will be 
> reduced. Details of SQL and test data can be found in the attachment



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (HIVE-25269) When the skew and parallel parameters are true simultaneously, the result is less data

2021-06-21 Thread GuangMing Lu (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-25269?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

GuangMing Lu updated HIVE-25269:

Attachment: (was: test.sql)

> When the skew and parallel parameters are true simultaneously, the result is 
> less data
> --
>
> Key: HIVE-25269
> URL: https://issues.apache.org/jira/browse/HIVE-25269
> Project: Hive
>  Issue Type: Bug
>  Components: Physical Optimizer, SQL
>Affects Versions: 3.1.0, 3.1.2
>Reporter: GuangMing Lu
>Priority: Major
> Attachments: comb_classcode.data, 样例分析-表入数据.sql
>
>
> When the params of hive.optimize.skewjoin, hive.groupby.skewindata and 
> hive.exec.parallel are true, and exec sql such as 'INSERT... FROM (SUBQUERY 
> UNIONALL ...GROUP BY...) A JOIN/LEFT JOIN A.expression', result data will be 
> reduced. Details of SQL and test data can be found in the attachment



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (HIVE-25269) When the skew and parallel parameters are true simultaneously, the result is less data

2021-06-21 Thread GuangMing Lu (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-25269?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

GuangMing Lu updated HIVE-25269:

Attachment: (was: table_d_data.orc)

> When the skew and parallel parameters are true simultaneously, the result is 
> less data
> --
>
> Key: HIVE-25269
> URL: https://issues.apache.org/jira/browse/HIVE-25269
> Project: Hive
>  Issue Type: Bug
>  Components: Physical Optimizer, SQL
>Affects Versions: 3.1.0, 3.1.2
>Reporter: GuangMing Lu
>Priority: Major
> Attachments: comb_classcode.data, 样例分析-表入数据.sql
>
>
> When the params of hive.optimize.skewjoin, hive.groupby.skewindata and 
> hive.exec.parallel are true, and exec sql such as 'INSERT... FROM (SUBQUERY 
> UNIONALL ...GROUP BY...) A JOIN/LEFT JOIN A.expression', result data will be 
> reduced. Details of SQL and test data can be found in the attachment



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (HIVE-25269) When the skew and parallel parameters are true simultaneously, the result is less data

2021-06-21 Thread GuangMing Lu (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-25269?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

GuangMing Lu updated HIVE-25269:

Attachment: comb_classcode.data

> When the skew and parallel parameters are true simultaneously, the result is 
> less data
> --
>
> Key: HIVE-25269
> URL: https://issues.apache.org/jira/browse/HIVE-25269
> Project: Hive
>  Issue Type: Bug
>  Components: Physical Optimizer, SQL
>Affects Versions: 3.1.0, 3.1.2
>Reporter: GuangMing Lu
>Priority: Major
> Attachments: comb_classcode.data, 样例分析-表入数据.sql
>
>
> When the params of hive.optimize.skewjoin, hive.groupby.skewindata and 
> hive.exec.parallel are true, and exec sql such as 'INSERT... FROM (SUBQUERY 
> UNIONALL ...GROUP BY...) A JOIN/LEFT JOIN A.expression', result data will be 
> reduced. Details of SQL and test data can be found in the attachment



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (HIVE-25269) When the skew and parallel parameters are true simultaneously, the result is less data

2021-06-21 Thread GuangMing Lu (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-25269?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

GuangMing Lu updated HIVE-25269:

Attachment: (was: table_c_data.orc)

> When the skew and parallel parameters are true simultaneously, the result is 
> less data
> --
>
> Key: HIVE-25269
> URL: https://issues.apache.org/jira/browse/HIVE-25269
> Project: Hive
>  Issue Type: Bug
>  Components: Physical Optimizer, SQL
>Affects Versions: 3.1.0, 3.1.2
>Reporter: GuangMing Lu
>Priority: Major
> Attachments: comb_classcode.data, 样例分析-表入数据.sql
>
>
> When the params of hive.optimize.skewjoin, hive.groupby.skewindata and 
> hive.exec.parallel are true, and exec sql such as 'INSERT... FROM (SUBQUERY 
> UNIONALL ...GROUP BY...) A JOIN/LEFT JOIN A.expression', result data will be 
> reduced. Details of SQL and test data can be found in the attachment



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (HIVE-25269) When the skew and parallel parameters are true simultaneously, the result is less data

2021-06-21 Thread GuangMing Lu (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-25269?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

GuangMing Lu updated HIVE-25269:

Attachment: (was: table_a_data.orc)

> When the skew and parallel parameters are true simultaneously, the result is 
> less data
> --
>
> Key: HIVE-25269
> URL: https://issues.apache.org/jira/browse/HIVE-25269
> Project: Hive
>  Issue Type: Bug
>  Components: Physical Optimizer, SQL
>Affects Versions: 3.1.0, 3.1.2
>Reporter: GuangMing Lu
>Priority: Major
> Attachments: comb_classcode.data, 样例分析-表入数据.sql
>
>
> When the params of hive.optimize.skewjoin, hive.groupby.skewindata and 
> hive.exec.parallel are true, and exec sql such as 'INSERT... FROM (SUBQUERY 
> UNIONALL ...GROUP BY...) A JOIN/LEFT JOIN A.expression', result data will be 
> reduced. Details of SQL and test data can be found in the attachment



--
This message was sent by Atlassian Jira
(v8.3.4#803005)