GitHub user viirya opened a pull request: https://github.com/apache/spark/pull/22514
[SPARK-25271][SQL] Hive ctas commands should use data source if it is convertible ## What changes were proposed in this pull request? We have a [regression](https://github.com/apache/spark/pull/20521/files#r217254430) since 2.3.1 that Hive ctas command only uses Hive Serde to write data. Hive ctas command previously will use Parquet/Orc data source to write data if it is convertible. Because of it, the related regression reported by this JIRA is when writing a empty map in to Hive using ctas, it hits Hive's known issue and is thrown exception. ## How was this patch tested? Added test. You can merge this pull request into a Git repository by running: $ git pull https://github.com/viirya/spark-1 SPARK-25271-2 Alternatively you can review and apply these changes as the patch at: https://github.com/apache/spark/pull/22514.patch To close this pull request, make a commit to your master/trunk branch with (at least) the following in the commit message: This closes #22514 ---- commit 5debc6096ae6e505d3386fd7eb569d154f158d55 Author: Liang-Chi Hsieh <viirya@...> Date: 2018-09-12T10:33:53Z Hive ctas commands should use data source format if it is convertible. ---- --- --------------------------------------------------------------------- To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org