[ 
https://issues.apache.org/jira/browse/SPARK-15365?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15287067#comment-15287067
 ] 

Apache Spark commented on SPARK-15365:
--------------------------------------

User 'Parth-Brahmbhatt' has created a pull request for this issue:
https://github.com/apache/spark/pull/13150

> Metastore relation should fallback to HDFS size if statistics are not 
> available from table meta data.
> -----------------------------------------------------------------------------------------------------
>
>                 Key: SPARK-15365
>                 URL: https://issues.apache.org/jira/browse/SPARK-15365
>             Project: Spark
>          Issue Type: Improvement
>          Components: SQL
>            Reporter: Parth Brahmbhatt
>
> Currently if a table is used in join operation we rely on Metastore returned 
> size to calculate if we can convert the operation to Broadcast join. This 
> optimization only kicks in for table's that have the statics available in 
> metastore. Hive generally rolls over to HDFS if the statistics are not 
> available directly from metastore and this seems like a reasonable choice to 
> adopt given the optimization benefit of using broadcast joins.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to