Noemi Pap-Takacs created IMPALA-13501:
-----------------------------------------

             Summary: Conflicting commits to Iceberg tables leave uncommitted 
orphan files
                 Key: IMPALA-13501
                 URL: https://issues.apache.org/jira/browse/IMPALA-13501
             Project: IMPALA
          Issue Type: Improvement
          Components: Catalog
            Reporter: Noemi Pap-Takacs


Iceberg supports multiple writers with optimistic concurrency. Each writer can 
write new files which are then added to the table after a validation check to 
ensure that the commit does not conflict with other modifications made during 
the execution.

When there was a conflicting change and the newly written files cannot be 
committed, there are 2 ways to proceed: the commit can be retried and rebased 
on top of the latest snapshot. If this cannot resolve the conflict, the change 
cannot be committed and the files become orphan files in the file system.

It would be nice to remove the remaining files from an unsuccessful commit in 
one step. Deleting orphan files later as a table maintenance step is also a 
possible resolution.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to