[GitHub] [arrow-datafusion] JanKaul commented on issue #6339: Simplified TableProvider::Insert API

via GitHub Fri, 12 May 2023 06:46:32 -0700


JanKaul commented on issue #6339:
URL: 
https://github.com/apache/arrow-datafusion/issues/6339#issuecomment-1545777125


   This sounds like a great approach. Thank you for bringing it up.
   
   I'm trying to implement the insert operation for Apache Iceberg Tables and I 
have one point which I would like to make:
   
   Apache Iceberg supports transactions in which datafiles(parquetfiles) can be 
added to the table. For that the `DataSink` trait provides a great way to write 
the files to an external storage. However, after the datafiles are written, the 
metadata of the iceberg table has to be updated. For that it would be necessary 
to have some kind of `flush` method in the `DataSink` trait that allows to 
update the state of the table.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

[GitHub] [arrow-datafusion] JanKaul commented on issue #6339: Simplified TableProvider::Insert API

Reply via email to