[
https://issues.apache.org/jira/browse/HUDI-4278?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
ASF GitHub Bot updated HUDI-4278:
---------------------------------
Labels: pull-request-available (was: )
> Add skip archive option when syncing to AWS Glue tables
> -------------------------------------------------------
>
> Key: HUDI-4278
> URL: https://issues.apache.org/jira/browse/HUDI-4278
> Project: Apache Hudi
> Issue Type: Improvement
> Reporter: Wenning Ding
> Priority: Major
> Labels: pull-request-available
>
> The issue is each time when Hudi upserts records, it would sync to the
> catalog and update {{last_commit_time_sync}} for the Glue table. Each time it
> updates this property, Glue by default would create a new table version and
> archive old versions. So the problem is if customers update the Hudi table
> frequently, eventually they would hit the Glue table version limit.
> So here inside Hudi, we pass a parameter {{skipGlueArchive}} to the
> environment context to finally pass it to {{{}AWS Glue metadata service{}}},
> so Glue client has an option to decide whether to skip archive or not.
--
This message was sent by Atlassian Jira
(v8.20.7#820007)