HyukjinKwon commented on a change in pull request #22758: [SPARK-25332][SQL] 
Instead of broadcast hash join ,Sort merge join node is added in the plan for 
the join queries executed in a new spark session/context
URL: https://github.com/apache/spark/pull/22758#discussion_r265827554
 
 

 ##########
 File path: 
sql/hive/src/main/scala/org/apache/spark/sql/hive/HiveMetastoreCatalog.scala
 ##########
 @@ -193,6 +193,16 @@ private[hive] class HiveMetastoreCatalog(sparkSession: 
SparkSession) extends Log
           None)
         val logicalRelation = cached.getOrElse {
           val updatedTable = inferIfNeeded(relation, options, fileFormat)
+          // Intialize the catalogTable stats if its not defined.An intial 
value has to be defined
+          // so that the hive statistics will be updated after each insert 
command.
+          val withStats = {
+            if (updatedTable.stats == None) {
 
 Review comment:
   @wangyum, so it is basically subset of #22721? It's funny that Hive tables 
should set the initial stats alone here, which is supposed to be set somewhere 
else.

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

Reply via email to