[
https://issues.apache.org/jira/browse/SPARK-31894?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17168564#comment-17168564
]
Apache Spark commented on SPARK-31894:
--------------------------------------
User 'xuanyuanking' has created a pull request for this issue:
https://github.com/apache/spark/pull/29315
> Introduce UnsafeRow format validation for streaming state store
> ---------------------------------------------------------------
>
> Key: SPARK-31894
> URL: https://issues.apache.org/jira/browse/SPARK-31894
> Project: Spark
> Issue Type: Bug
> Components: Structured Streaming
> Affects Versions: 3.1.0
> Reporter: Yuanjian Li
> Assignee: Yuanjian Li
> Priority: Major
> Fix For: 3.1.0
>
>
> Currently, Structured Streaming directly puts the UnsafeRow into StateStore
> without any schema validation. It's a dangerous behavior when users reusing
> the checkpoint file during migration. Any changes or bug fix related to the
> aggregate function may cause random exceptions, even the wrong answer, e.g
> SPARK-28067.
> Here we introduce an UnsafeRow format validation for the state store.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]