JanKaul commented on issue #6339: URL: https://github.com/apache/arrow-datafusion/issues/6339#issuecomment-1545777125
This sounds like a great approach. Thank you for bringing it up. I'm trying to implement the insert operation for Apache Iceberg Tables and I have one point which I would like to make: Apache Iceberg supports transactions in which datafiles(parquetfiles) can be added to the table. For that the `DataSink` trait provides a great way to write the files to an external storage. However, after the datafiles are written, the metadata of the iceberg table has to be updated. For that it would be necessary to have some kind of `flush` method in the `DataSink` trait that allows to update the state of the table. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
