gengliangwang opened a new pull request #23714: [SPARK-26744][SQL]Support schema validation in FileDataSourceV2 framework URL: https://github.com/apache/spark/pull/23714 ## What changes were proposed in this pull request? The API `supportDataType` in `FileFormat` helps to validate the output/input schema before execution starts. So that we can avoid some invalid data source IO, and users can see clean error messages. This PR is to implement the same API in the `FileDataSourceV2` framework. Comparing to `FileFormat`, `FileDataSourceV2` has multiple layers. The API is added in two places: 1. `FileWriteBuilder`: this is where we can get the actual write schema 2. `FileScan`: this is where we can get the actual read schema. ## How was this patch tested? Unit test
---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected] With regards, Apache Git Services --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
