[
https://issues.apache.org/jira/browse/HIVE-15489?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15861978#comment-15861978
]
Xuefu Zhang commented on HIVE-15489:
------------------------------------
{quote}
I've thought about this. The downside is many good cases will be turned to
reduce join as well. But I think this config is mainly for stability, so it
should be fine, as long as we document this well. Will add to next patch.
{quote}
My concern is that the map joins down below may also suffer the consequence of
inaccurate stats.
{quote}
Do you think we should combine these two? since they are similar.
{quote}
It's probably better to have two as they control behaviors on different
functionality.
> Alternatively use table scan stats for HoS
> ------------------------------------------
>
> Key: HIVE-15489
> URL: https://issues.apache.org/jira/browse/HIVE-15489
> Project: Hive
> Issue Type: Improvement
> Components: Spark, Statistics
> Affects Versions: 2.2.0
> Reporter: Chao Sun
> Assignee: Chao Sun
> Attachments: HIVE-15489.1.patch, HIVE-15489.2.patch,
> HIVE-15489.3.patch, HIVE-15489.4.patch, HIVE-15489.wip.patch
>
>
> For MapJoin in HoS, we should provide an option to only use stats in the TS
> rather than the populated stats in each of the join branch. This could be
> pretty conservative but more reliable.
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)