[ 
https://issues.apache.org/jira/browse/HUDI-8808?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Y Ethan Guo updated HUDI-8808:
------------------------------
    Remaining Estimate: 2h
     Original Estimate: 2h

> Fix concurrent execution of appending rollback blocks in the same file group
> ----------------------------------------------------------------------------
>
>                 Key: HUDI-8808
>                 URL: https://issues.apache.org/jira/browse/HUDI-8808
>             Project: Apache Hudi
>          Issue Type: Sub-task
>            Reporter: Y Ethan Guo
>            Priority: Blocker
>              Labels: pull-request-available
>             Fix For: 1.0.1
>
>   Original Estimate: 2h
>  Remaining Estimate: 2h
>
> When there are multiple log files generated in the same file group in a 
> inflight deltacommit, rollback of such a deltacommit can fail, because (1) 
> the rollback plan contains multiple rollback requests targeting the log files 
> to roll back in the same file group (2) concurrent execution of these 
> rollback requests in Spark executors cause creation of new rollback log files 
> in parallel, which involves determining the new log version concurrently, 
> leading to the same new log version to be used in multiple executors, causing 
> marker creation to fail (i.e., multiple executors try to create the marker on 
> the same file name, and subsequent marker creation requests fail due to the 
> fact that marker already exists).
> Note that this is for Hudi 0.x format only (backwards compatible writer).



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to