XixiHua created HUDI-4396:
-----------------------------
Summary: Add a boolean parameter to decide whether the partition
is cascade or not when hive table columns changes
Key: HUDI-4396
URL: https://issues.apache.org/jira/browse/HUDI-4396
Project: Apache Hudi
Issue Type: Improvement
Reporter: XixiHua
Currently, when using the HudiHiveSync and DataHubSyncTool tools to do hive
table columns changes, this happens at HMSDDLExecutor.updateTableDefinition(),
this cascade is only decided by META_SYNC_PARTITION_FIELDS:
{code:java}
boolean cascade = syncConfig.getSplitStrings(META_SYNC_PARTITION_FIELDS).size()
> 0;{code}
but some scenarios do not need to update partition columns but this will cost a
lot of time even the hive might hang when the partition number is large.
Therefore, I want to add a supplement boolean config parameter:
*PARTITION_CASCADE_WITH_COLUMN_CHANGE*
the default is true, if the users don't want to update the partition then set
it to false
--
This message was sent by Atlassian Jira
(v8.20.10#820010)