Daniel Kaźmirski created HUDI-4276:
--------------------------------------
Summary: Reconcile schema - inject null values for missing fields
and add new fields
Key: HUDI-4276
URL: https://issues.apache.org/jira/browse/HUDI-4276
Project: Apache Hudi
Issue Type: Improvement
Reporter: Daniel Kaźmirski
Improve schema reconciliation to make it more flexible in presence of full
schema evolution enabled.
Desired behavior:
# incoming data has missing columns that were already defined in the table –>
null values will be injected into missing columns
# incoming data contains new columns not defined yet in the table -> columns
will be added to the table schema (incoming dataframe?)
# incoming data has missing columns in the table and new columns in the table
-> new columns will be added to the table schema, missing columns will be
injected with null values
No column should be dropped when using hive sync utility.
Related GH issue:
[https://github.com/apache/hudi/issues/5873]
--
This message was sent by Atlassian Jira
(v8.20.7#820007)