[GitHub] spark pull request #19864: [SPARK-22673][SQL] InMemoryRelation should utiliz...

hvanhovell Wed, 13 Dec 2017 02:01:43 -0800

Github user hvanhovell commented on a diff in the pull request:

    https://github.com/apache/spark/pull/19864#discussion_r156609849
  
    --- Diff: 
sql/core/src/main/scala/org/apache/spark/sql/execution/columnar/InMemoryRelation.scala
 ---
    @@ -60,7 +62,8 @@ case class InMemoryRelation(
         @transient child: SparkPlan,
         tableName: Option[String])(
         @transient var _cachedColumnBuffers: RDD[CachedBatch] = null,
    -    val batchStats: LongAccumulator = 
child.sqlContext.sparkContext.longAccumulator)
    +    val batchStats: LongAccumulator = 
child.sqlContext.sparkContext.longAccumulator,
    +    statsOfPlanToCache: Option[Statistics] = None)
    --- End diff --
    
    Yeah, the secondary argument list seems a better place. I don't think we 
should incorporate the stats in the hash/equals method.



---

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[GitHub] spark pull request #19864: [SPARK-22673][SQL] InMemoryRelation should utiliz...

Reply via email to