Xiao Li created SPARK-21769:
-------------------------------

             Summary: Add a table property for Hive-serde tables to controlling 
Spark always respecting schemas inferred by Spark SQL
                 Key: SPARK-21769
                 URL: https://issues.apache.org/jira/browse/SPARK-21769
             Project: Spark
          Issue Type: Bug
          Components: SQL
    Affects Versions: 2.3.0
            Reporter: Xiao Li
            Assignee: Xiao Li


For Hive-serde tables, we always respect the schema stored in Hive metastore, 
because the schema could be altered by the other engines that share the same 
metastore. Thus, we always trust the metastore-controlled schema for Hive-serde 
tables when the schemas are different (without considering the nullability and 
cases). However, in some scenarios, Hive metastore also could INCORRECTLY 
overwrite the schemas when the serde and Hive metastore built-in serde are 
different. 

The proposed solution is to introduce a table property for such scenarios. For 
a specific Hive-serde table, users can manually setting such table property for 
asking Spark for always respect Spark-inferred schema instead of trusting 
metastore-controlled schema. By default, it is off. 




--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to