Sanford Ryza created SPARK-54358:
------------------------------------

             Summary: Checkpoint dirs collide when streaming tables in 
different schemas have same name
                 Key: SPARK-54358
                 URL: https://issues.apache.org/jira/browse/SPARK-54358
             Project: Spark
          Issue Type: Sub-task
          Components: Declarative Pipelines
    Affects Versions: 4.1.0
            Reporter: Sanford Ryza


A streaming table is a table fed by a stream. Streaming tables have checkpoint 
directories underneath their pipeline's storage root. These directories don't 
currently take the table namespace into account, which means that two tables 
with different schemas but the same name will be mapped to the same checkpoint 
directory. This could be very bad and cause data loss.

We should update the per-streaming table checkpoint path to use the fully 
qualified table path, instead of just its name.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to