Sorry, some links are wrong. Here I fixed it to the correct link:

   -

   Address some unexpected query results (see the Appendix
   
<https://docs.google.com/document/d/1uWRMbN927cRXhSm5oeV3pbwb6o73am4r1ckEJDhAHa0/edit#heading=h.ehd1najwmeos>
of
   the design document for examples).
   -

   Reduce the likelihood of RocksDB corruptions caused by checkpointing a
   new version based on a previous version that is not the official version.
   The first section
   
<https://docs.google.com/document/d/1uWRMbN927cRXhSm5oeV3pbwb6o73am4r1ckEJDhAHa0/edit#heading=h.z2wsnxxpksge>
of
   the documentation explains how this can happen.

The other links are correct.

Thanks,

Siying

On Tue, Aug 27, 2024 at 7:21 PM Siying Dong <siying.d...@databricks.com>
wrote:

> Hi,
>
> We are planning to work on a feature that improves how the RocksDB State
> Store organizes checkpoint versions in cloud storage. This improvement aims
> to:
>
>    -
>
>    Address some unexpected query results (see the Appendix
>    
> <https://docs.google.com/document/d/1uWRMbN927cRXhSm5oeV3pbwb6o73am4r1ckEJDhAHa0/edit#heading=h.ehd1najwmeos>
>    of the design document for examples).
>    -
>
>    Reduce the likelihood of RocksDB corruptions caused by checkpointing a
>    new version based on a previous version that is not the official version.
>    The first section
>    
> <https://docs.google.com/document/d/1pT5bwW325VndVH09aZd7TykKXbBP5uCrNIQOC0uDoLc/edit#heading=h.z2wsnxxpksge>
>    of the documentation explains how this can happen.
>
> The core idea is to ensure that the state store checkpoint for a batch
> version strictly depends on the previously checkpointed version. We propose
> achieving this by introducing a unique ID for each state store checkpoint
> and persisting this ID in the commit logs. This feature will be controlled
> by a new SQL configuration, allowing users to switch between the old and
> new modes when running a supported release.
>
> You can find the design document here
> <https://docs.google.com/document/d/1pT5bwW325VndVH09aZd7TykKXbBP5uCrNIQOC0uDoLc/edit>
> and the related JIRA ticket SPARK-49374
> <https://issues.apache.org/jira/browse/SPARK-49374>.
>
> As we believe this improvement isn't significant enough to require going
> through the SPIP process, we don't plan to pursue that route. However, if
> you have any concerns, please let us know. We would also greatly appreciate
> any comments or feedback on the design.
>
> Best regards,
>
>
> Siying
>

Reply via email to