[GitHub] spark pull request #21560: [SPARK-24386][SS] coalesce(1) aggregates in conti...

jose-torres Tue, 26 Jun 2018 17:42:44 -0700

Github user jose-torres commented on a diff in the pull request:

    https://github.com/apache/spark/pull/21560#discussion_r198337615
  
    --- Diff: 
sql/core/src/main/scala/org/apache/spark/sql/execution/streaming/continuous/ContinuousDataSourceRDD.scala
 ---
    @@ -51,7 +51,7 @@ class ContinuousDataSourceRDD(
         sc: SparkContext,
         dataQueueSize: Int,
         epochPollIntervalMs: Long,
    -    @transient private val readerFactories: Seq[InputPartition[UnsafeRow]])
    +    private val readerFactories: Seq[InputPartition[UnsafeRow]])
    --- End diff --
    
    We list the partitions when computing the coalesce RDD. Should we instead 
be packing the partitions into the partitions of the coalesce RDD? I'd assumed 
it was valid to expect that rdd.partitions would work on executors, but maybe 
it's not.



---

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[GitHub] spark pull request #21560: [SPARK-24386][SS] coalesce(1) aggregates in conti...

Reply via email to