[
https://issues.apache.org/jira/browse/HIVE-15808?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Deepak Jaiswal updated HIVE-15808:
----------------------------------
Description: If there is a semijoin branch on the same operator pipeline
which contains a hash join then it is by design on big table which is not
optimal. The operator cycle detection logic may not find a cycle as there is no
cycle at operator level. However, once Tez builds its task there can be a cycle
at task level causing the query to fail. (was: It is found that the current
logic of cycle detection does not find cycles created when there is a semijoin
branch parallel to a hash join on a reducer.
To avoid such cycles, remove the semijoin reduction optimization.)
Summary: Remove semijoin reduction branch if it is on bigtable along
with hash join (was: Remove Semijoin reduction branch on reducers if there is
hash join)
> Remove semijoin reduction branch if it is on bigtable along with hash join
> --------------------------------------------------------------------------
>
> Key: HIVE-15808
> URL: https://issues.apache.org/jira/browse/HIVE-15808
> Project: Hive
> Issue Type: Bug
> Reporter: Deepak Jaiswal
> Assignee: Deepak Jaiswal
> Attachments: HIVE-15808.patch
>
>
> If there is a semijoin branch on the same operator pipeline which contains a
> hash join then it is by design on big table which is not optimal. The
> operator cycle detection logic may not find a cycle as there is no cycle at
> operator level. However, once Tez builds its task there can be a cycle at
> task level causing the query to fail.
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)