[jira] [Created] (HUDI-2682) Spark schema not updated with new columns on hive sync

Charlie Briggs (Jira) Wed, 03 Nov 2021 09:48:07 -0700

Charlie Briggs created HUDI-2682:
------------------------------------

             Summary: Spark schema not updated with new columns on hive sync
                 Key: HUDI-2682
                 URL: https://issues.apache.org/jira/browse/HUDI-2682
             Project: Apache Hudi
          Issue Type: Bug
          Components: Hive Integration
    Affects Versions: 0.9.0
            Reporter: Charlie Briggs



When syncing hive schema, new columns added from the source dataset are not 
propagated to the `spark.sql.sources.schema` metadata. This leads to columns 
not being available when querying the dataset via spark SQL.

Tested with both spark data writer and deltastreamer). 

The column we observed this on was a struct column, but it seems like it would 
be independent.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

[jira] [Created] (HUDI-2682) Spark schema not updated with new columns on hive sync

Reply via email to