[
https://issues.apache.org/jira/browse/HIVE-26968?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Stamatis Zampetakis resolved HIVE-26968.
----------------------------------------
Fix Version/s: 4.0.0
Resolution: Fixed
Fixed in
https://github.com/apache/hive/commit/136aff9396cd53abe8b1b6b73856776db1316a88.
Thanks a lot for the PR [~seonggon]!
> Wrong results when shared work optimizer merges TS operator with different
> DPP edges
> ------------------------------------------------------------------------------------
>
> Key: HIVE-26968
> URL: https://issues.apache.org/jira/browse/HIVE-26968
> Project: Hive
> Issue Type: Sub-task
> Affects Versions: 4.0.0-alpha-2
> Reporter: Seonggon Namgung
> Assignee: Seonggon Namgung
> Priority: Critical
> Labels: hive-4.0.0-must, pull-request-available
> Fix For: 4.0.0
>
> Attachments: TPC-DS Query64 OperatorGraph.pdf
>
> Time Spent: 50m
> Remaining Estimate: 0h
>
> SharedWorkOptimizer merges TableScan operators that have different DPP
> parents, which leads to the creation of semantically wrong query plan.
> In our environment, running TPC-DS query64 on 1TB Iceberg format table
> returns no rows because of this problem. (The correct result has 7094 rows.)
> We use hive.optimize.shared.work=true,
> hive.optimize.shared.work.extended=true, and
> hive.optimize.shared.work.dppunion=false to reproduce the bug.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)