[GitHub] spark pull request #22514: [SPARK-25271][SQL] Hive ctas commands should use ...

viirya Thu, 20 Sep 2018 23:25:15 -0700

GitHub user viirya opened a pull request:

    https://github.com/apache/spark/pull/22514


    [SPARK-25271][SQL] Hive ctas commands should use data source if it is 
convertible

    ## What changes were proposed in this pull request?
    
    We have a 
[regression](https://github.com/apache/spark/pull/20521/files#r217254430) since 
2.3.1 that Hive ctas command only uses Hive Serde to write data. Hive ctas 
command previously will use Parquet/Orc data source to write data if it is 
convertible.
    
    Because of it, the related regression reported by this JIRA is when writing 
a empty map in to Hive using ctas, it hits Hive's known issue and is thrown 
exception.
    
    ## How was this patch tested?
    
    Added test.


You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/viirya/spark-1 SPARK-25271-2

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/spark/pull/22514.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #22514
    
----
commit 5debc6096ae6e505d3386fd7eb569d154f158d55
Author: Liang-Chi Hsieh <viirya@...>
Date:   2018-09-12T10:33:53Z

    Hive ctas commands should use data source format if it is convertible.

----


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request #22514: [SPARK-25271][SQL] Hive ctas commands should use ...

Reply via email to