gengliangwang opened a new pull request #23714: [SPARK-26744][SQL]Support 
schema validation in FileDataSourceV2 framework
URL: https://github.com/apache/spark/pull/23714
 
 
   ## What changes were proposed in this pull request?
   
   The API `supportDataType` in `FileFormat` helps to validate the output/input 
schema before execution starts. So that we can avoid some invalid data source 
IO, and users can see clean error messages.
   
   This PR is to implement the same API in the `FileDataSourceV2` framework. 
Comparing to `FileFormat`, `FileDataSourceV2` has multiple layers. The API is 
added in two places:
   1. `FileWriteBuilder`: this is where we can get the actual write schema
   2. `FileScan`: this is where we can get the actual read schema.
   
   ## How was this patch tested?
   
   Unit test

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
[email protected]


With regards,
Apache Git Services

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to