GitHub user tdas opened a pull request:

    https://github.com/apache/spark/pull/18402

    [SPARK-21192][SS] Preserve State Store provider class configuration across 
restarts

    ## What changes were proposed in this pull request?
    
    If the SQL conf for StateStore provider class is changed between restarts 
(i.e. query started with providerClass1 and attempted to restart using 
providerClass2), then the query will fail in a unpredictable way as files saved 
by one provider class cannot be used by the newer one.
    
    Ideally, the provider class used to start the query should be used to 
restart the query, and the configuration in the session where it is being 
restarted should be ignored.
    
    This PR saves the provider class config to OffsetSeqLog, in the same way # 
shuffle partitions is saved and recovered.
    
    ## How was this patch tested?
    new unit tests

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/tdas/spark SPARK-21192

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/spark/pull/18402.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #18402
    
----
commit 0255e5d5246dd97e1f4721648a693a2c2b7cde97
Author: Tathagata Das <[email protected]>
Date:   2017-06-23T10:20:47Z

    Added providerClass to OffsetSeqLog

----


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to