[
https://issues.apache.org/jira/browse/HIVE-25026?focusedWorklogId=585094&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-585094
]
ASF GitHub Bot logged work on HIVE-25026:
-----------------------------------------
Author: ASF GitHub Bot
Created on: 19/Apr/21 12:10
Start Date: 19/Apr/21 12:10
Worklog Time Spent: 10m
Work Description: zhangheihei opened a new pull request #2189:
URL: https://github.com/apache/hive/pull/2189
**Hive task job will gen duplicate data cause of same task resubmission**
```
2021-04-05 06:05:52 CONSOLE# Number of reduce tasks is set to 0 since
there's no reduce operator
2021-04-05 06:05:52 CONSOLE# Launching Job 5 out of 4
2021-04-05 06:05:52 CONSOLE# Number of reduce tasks is set to 0 since
there's no reduce operator
```
<img
src="https://user-images.githubusercontent.com/13237066/115213523-2d945800-a134-11eb-94c3-52095c748283.png"
width="300" height="300">
For example, hive sql explain 4 task. when hive.exec.parallel=true and
task2/task3 is canExecuteInParallel,task4 will execute 2 times;
1. task1 is FINISHED, task2/task3 enter Runnable queue
<img
src="https://user-images.githubusercontent.com/13237066/115233371-65a69580-a14a-11eb-81fb-5a0c3582e3dc.png"
width="400" height="150">
2. task2/task3 is executed in parallel and ends at the same time. Now
task2/task3 is FINISHED
<img
src="https://user-images.githubusercontent.com/13237066/115233876-06955080-a14b-11eb-9570-7334eff8dcad.png"
width="400" height="150">
3. task2 removed from running queue, task4 will enter runnable queue
4.
4.
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:
[email protected]
Issue Time Tracking
-------------------
Worklog Id: (was: 585094)
Remaining Estimate: 0h
Time Spent: 10m
> hive sql result is duplicate data cause of same task resubmission
> -----------------------------------------------------------------
>
> Key: HIVE-25026
> URL: https://issues.apache.org/jira/browse/HIVE-25026
> Project: Hive
> Issue Type: Bug
> Components: Hive
> Affects Versions: 3.1.1
> Reporter: hezhang
> Assignee: hezhang
> Priority: Critical
> Time Spent: 10m
> Remaining Estimate: 0h
>
> This issue is the same with hive-24577
--
This message was sent by Atlassian Jira
(v8.3.4#803005)