[ 
https://issues.apache.org/jira/browse/HIVE-15489?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15861978#comment-15861978
 ] 

Xuefu Zhang commented on HIVE-15489:
------------------------------------

{quote}
I've thought about this. The downside is many good cases will be turned to 
reduce join as well. But I think this config is mainly for stability, so it 
should be fine, as long as we document this well. Will add to next patch.
{quote}
My concern is that the map joins down below may also suffer the consequence of 
inaccurate stats.

{quote}
Do you think we should combine these two? since they are similar.
{quote}
It's probably better to have two as they control behaviors on different 
functionality.

> Alternatively use table scan stats for HoS
> ------------------------------------------
>
>                 Key: HIVE-15489
>                 URL: https://issues.apache.org/jira/browse/HIVE-15489
>             Project: Hive
>          Issue Type: Improvement
>          Components: Spark, Statistics
>    Affects Versions: 2.2.0
>            Reporter: Chao Sun
>            Assignee: Chao Sun
>         Attachments: HIVE-15489.1.patch, HIVE-15489.2.patch, 
> HIVE-15489.3.patch, HIVE-15489.4.patch, HIVE-15489.wip.patch
>
>
> For MapJoin in HoS, we should provide an option to only use stats in the TS 
> rather than the populated stats in each of the join branch. This could be 
> pretty conservative but more reliable.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Reply via email to