[
https://issues.apache.org/jira/browse/SPARK-15365?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Spark reassigned SPARK-15365:
------------------------------------
Assignee: (was: Apache Spark)
> Metastore relation should fallback to HDFS size if statistics are not
> available from table meta data.
> -----------------------------------------------------------------------------------------------------
>
> Key: SPARK-15365
> URL: https://issues.apache.org/jira/browse/SPARK-15365
> Project: Spark
> Issue Type: Improvement
> Components: SQL
> Reporter: Parth Brahmbhatt
>
> Currently if a table is used in join operation we rely on Metastore returned
> size to calculate if we can convert the operation to Broadcast join. This
> optimization only kicks in for table's that have the statics available in
> metastore. Hive generally rolls over to HDFS if the statistics are not
> available directly from metastore and this seems like a reasonable choice to
> adopt given the optimization benefit of using broadcast joins.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]