[
https://issues.apache.org/jira/browse/HIVE-20393?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Deepak Jaiswal updated HIVE-20393:
----------------------------------
Status: Patch Available (was: In Progress)
> Semijoin Reduction : markSemiJoinForDPP behaves inconsistently
> --------------------------------------------------------------
>
> Key: HIVE-20393
> URL: https://issues.apache.org/jira/browse/HIVE-20393
> Project: Hive
> Issue Type: Bug
> Reporter: Deepak Jaiswal
> Assignee: Deepak Jaiswal
> Priority: Major
>
> markSemiJoinForDPP has multiple issues,
>
> * Uses map tsOps which is wrong as it disallows going thru same TS which may
> have filters from more than 1 semijoin edges. This results in inconsistent
> plans for same query as semijoin edges may be processed in different order
> each time.
> * Uses getColumnExpr() which is not as robust as extractColumn() thus
> resulting in NPEs.
> * The logic to mark an edge useful when NPE is hit may end up having bad
> edge.
> cc [~gopalv]
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)