Qijun Fu created HUDI-6959:
------------------------------

             Summary: Do not rollback current instant when bulk insert as row 
failed
                 Key: HUDI-6959
                 URL: https://issues.apache.org/jira/browse/HUDI-6959
             Project: Apache Hudi
          Issue Type: Bug
          Components: spark
            Reporter: Qijun Fu


When org.apache.hudi.spark3.internal.HoodieDataSourceInternalBatchWrite#abort 
is called, all the subtasks may not have already been canceled. So if we 
rollback current instant immediately, there may be new files been written after 
rollback scheduled, which will cause dirty data.

 

We should rollback the failed instant using common mechanism eager and lazy 



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to