Zhihua Deng created HIVE-27494:
----------------------------------
Summary: Deduplicate the task result that generated by more
branches in union all
Key: HIVE-27494
URL: https://issues.apache.org/jira/browse/HIVE-27494
Project: Hive
Issue Type: Bug
Reporter: Zhihua Deng
Assignee: Zhihua Deng
HIVE-23891 adds the ability to deduplicate the task result that under the
directory,
<table-dir>/<staging-dir>/_tmp.-ext-10000/<dynamic-partition-dir>/HIVE_UNION_SUBDIR_1,
but turns out to ignore taking the same action to the directory for the same
query:
<table-dir>/<staging-dir>/_tmp.-ext-10000/<dynamic-partition-dir>/HIVE_UNION_SUBDIR_2.
So user may still have the same data duplication problem in multiple tez task
attempts.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)