Github user hvanhovell commented on a diff in the pull request:
https://github.com/apache/spark/pull/19864#discussion_r156609849
--- Diff:
sql/core/src/main/scala/org/apache/spark/sql/execution/columnar/InMemoryRelation.scala
---
@@ -60,7 +62,8 @@ case class InMemoryRelation(
@transient child: SparkPlan,
tableName: Option[String])(
@transient var _cachedColumnBuffers: RDD[CachedBatch] = null,
- val batchStats: LongAccumulator =
child.sqlContext.sparkContext.longAccumulator)
+ val batchStats: LongAccumulator =
child.sqlContext.sparkContext.longAccumulator,
+ statsOfPlanToCache: Option[Statistics] = None)
--- End diff --
Yeah, the secondary argument list seems a better place. I don't think we
should incorporate the stats in the hash/equals method.
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]