johanl-db commented on PR #53732:
URL: https://github.com/apache/spark/pull/53732#issuecomment-3743631463

   > @johanl-db : A big question is where we should apply schema evolution. For 
MERGE INTO, @szehon-ho did a lot of work to do schema evolution within Spark, 
so that the behavior is controlled by Spark and all data source will have the 
same behavior. From this PR, it seems the data source need to implement INSERT 
schema evolution by its own?
   
   DSv1 data sources still need to do schema evolution themselves, this doesn't 
change with this PR.
   For DSv2, the goal is to have Spark handle schema evolution, afaik work is 
ongoing for MERGE INTO but and INSERT will eventually be covered.
   
   This is somewhat orthogonal to this PR that focuses on adding dedicated SQL 
syntax, but doesn't intend to extend schema evolution support beyond what 
exists today (i.e. mostly provided by DSv1 data source implementation 
themselves)


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to