[ https://issues.apache.org/jira/browse/HIVE-14246?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15380178#comment-15380178 ]
Gunther Hagleitner commented on HIVE-14246: ------------------------------------------- +1 > Tez: disable auto-reducer parallelism when CUSTOM_EDGE is in place > ------------------------------------------------------------------ > > Key: HIVE-14246 > URL: https://issues.apache.org/jira/browse/HIVE-14246 > Project: Hive > Issue Type: Bug > Components: Tez > Affects Versions: 2.2.0 > Reporter: Gopal V > Assignee: Gopal V > Priority: Minor > Fix For: 2.2.0 > > Attachments: HIVE-14246.1.patch > > > The CUSTOM_SIMPLE_EDGE impl has differences between the size constraints of > either edge which cannot be represented by the ShuffleVertexManager presently. > Reducing the width based on the hashtable build side vs the streaming probe > side have different consequences since there is no order of runtime between > them. > Until the two parent vertices of the shuffle hash-join are related, this > feature causes massive inconsistency of performance across runs. > For inner & semi joins, the hashtable side should have a higher priority than > the streaming side and for left outer joins, the streaming side can over-take > the hashtable side, being the more dominant factor in the final row-counts. > Until such priorities can be bubbled up into ShuffleVertexManager, this > feature can be disabled. -- This message was sent by Atlassian JIRA (v6.3.4#6332)