[
https://issues.apache.org/jira/browse/SPARK-5791?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14321777#comment-14321777
]
Yi Zhou edited comment on SPARK-5791 at 2/15/15 1:49 AM:
---------------------------------------------------------
For the same input data set size(e.g.,1TB), it costs about ~2mins on hive on
M/R with optimization parameters but it costs about ~1hour on SparkSQL.
was (Author: jameszhouyi):
For the same input dataset size, it costs about ~2mins on hive on M/R with
optimization parameters but it costs about ~1hour on SparkSQL.
> [Spark SQL] show poor performance when multiple table do join operation
> -----------------------------------------------------------------------
>
> Key: SPARK-5791
> URL: https://issues.apache.org/jira/browse/SPARK-5791
> Project: Spark
> Issue Type: Bug
> Components: SQL
> Affects Versions: 1.2.0
> Reporter: Yi Zhou
>
> Spark SQL shows poor performance when multiple tables do join operation
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]