[
https://issues.apache.org/jira/browse/SPARK-16669?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15421782#comment-15421782
]
Apache Spark commented on SPARK-16669:
--------------------------------------
User 'Parth-Brahmbhatt' has created a pull request for this issue:
https://github.com/apache/spark/pull/14655
> Partition pruning for metastore relation size estimates for better join
> selection.
> ----------------------------------------------------------------------------------
>
> Key: SPARK-16669
> URL: https://issues.apache.org/jira/browse/SPARK-16669
> Project: Spark
> Issue Type: Bug
> Components: SQL
> Affects Versions: 2.0.0
> Reporter: Parth Brahmbhatt
>
> Currently the metastore statistics returns the size of entire table which
> results in Join selection strategy to not use broadcast joins even when only
> a single partition from a large table is selected. We should optimize the
> statistic calculation at table level to apply partition pruning and only get
> the size of Partition that are valid for the query.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]