anishshri-db commented on code in PR #47104:
URL: https://github.com/apache/spark/pull/47104#discussion_r1662987074
##########
sql/core/src/main/scala/org/apache/spark/sql/execution/streaming/IncrementalExecution.scala:
##########
@@ -187,23 +187,33 @@ class IncrementalExecution(
}
}
- object WriteStatefulOperatorMetadataRule extends SparkPlanPartialRule {
+ // Planning rule used to record the state schema for the first run and
validate state schema
+ // changes across query runs.
+ object StateSchemaAndOperatorMetadataRule extends SparkPlanPartialRule {
override val rule: PartialFunction[SparkPlan, SparkPlan] = {
+ // In the case of TransformWithStateExec, we want to collect this
StateSchema
+ // filepath, and write this path out in the OperatorStateMetadata file
case stateStoreWriter: StateStoreWriter if isFirstBatch =>
+ val stateSchemaVersion = stateStoreWriter match {
+ case tws: TransformWithStateExec => tws.stateSchemaVersion
Review Comment:
This would not make it possible to evolve this in the future right ? if we
use the hard coded value ?
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]