Re: [PR] [SPARK-50017] Support Avro encoding for TransformWithState operator [spark]

via GitHub Fri, 08 Nov 2024 16:40:53 -0800


anishshri-db commented on code in PR #48401:
URL: https://github.com/apache/spark/pull/48401#discussion_r1835178324



##########
sql/core/src/main/scala/org/apache/spark/sql/execution/streaming/StreamingSymmetricHashJoinHelper.scala:
##########
@@ -303,6 +303,56 @@ object StreamingSymmetricHashJoinHelper extends Logging {
     }
   }
 
+  /**
+   * A custom RDD that allows partitions to be "zipped" together, while 
ensuring the tasks'
+   * preferred location is based on which executors have the required join 
state stores already
+   * loaded. This class is a variant of 
[[org.apache.spark.rdd.ZippedPartitionsRDD2]] which only
+   * changes signature of `f` by taking in a map of column family schemas. 
This is used for
+   * passing the column family schemas when there is initial state for the 
TransformWithStateExec
+   * operator
+   */
+  class StateStoreAwareZipPartitionsRDDWithSchemas[A: ClassTag, B: ClassTag, 
V: ClassTag](

Review Comment:
   lets remove these ?



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Re: [PR] [SPARK-50017] Support Avro encoding for TransformWithState operator [spark]

Reply via email to