[
https://issues.apache.org/jira/browse/HUDI-2961?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Rajesh Mahindra updated HUDI-2961:
----------------------------------
Priority: Major (was: Blocker)
> Async table services can race with metadata table updates
> ---------------------------------------------------------
>
> Key: HUDI-2961
> URL: https://issues.apache.org/jira/browse/HUDI-2961
> Project: Apache Hudi
> Issue Type: Task
> Components: writer-core
> Reporter: Manoj Govindassamy
> Assignee: Ethan Guo
> Priority: Major
> Fix For: 0.11.0
>
>
> Today Metadata table updates are done inline/synchronous with the data table
> updates. Metadata data table updates can also sometime trigger table services
> like compaction which are also done inline w.r.t the ongoing commit. So,
> updates in the metadata table are always serial. However, there can be async
> table services like clustering which are running in parallel with single or
> multiple writers and can update the metadata table in parallel with the
> writer commits.
> In the multi writer case, since we anyway have the lock provider configured
> metadata table updates are guarded for race. But, the lock providers are not
> must today for single writer + async table service deployments, leading to
> race in metadata table updates. Async table service like clustering can race
> with the metadata table compaction, and can update the wrong delta log file
> than the right next delta file from the compaction.
--
This message was sent by Atlassian Jira
(v8.20.1#820001)