Deepak Jaiswal created HIVE-20393: ------------------------------------- Summary: Semijoin Reduction : markSemiJoinForDPP behaves inconsistently Key: HIVE-20393 URL: https://issues.apache.org/jira/browse/HIVE-20393 Project: Hive Issue Type: Bug Reporter: Deepak Jaiswal Assignee: Deepak Jaiswal
markSemiJoinForDPP has multiple issues, * Uses map tsOps which is wrong as it disallows going thru same TS which may have filters from more than 1 semijoin edges. This results in inconsistent plans for same query as semijoin edges may be processed in different order each time. * Uses getColumnExpr() which is not as robust as extractColumn() thus resulting in NPEs. * The logic to mark an edge useful when NPE is hit may end up having bad edge. cc [~gopalv] -- This message was sent by Atlassian JIRA (v7.6.3#76005)