stevenzwu opened a new pull request #2305: URL: https://github.com/apache/iceberg/pull/2305
This is the first feature PR for FLIP-27 source. Currently, there are at least two open questions to be addressed. Since I will be out for the rest of the week, I like to put this out first. 1) @openinx suggested that we break the `DataIterator` into two levels (combined and file tasks). I have a question that maybe @openinx can confirm [in the comment from the uber PR](https://github.com/apache/iceberg/pull/2105#discussion_r568303231). 2) Reader is currently implemented on top of `FileSourceSplit` and `BulkFormat`. The original reason is that Jingsong mentioned that we may be able to take advantage of the high-performant vectorized readers from Flink. But I am revisiting that decision. It is unlikely Flink's vectorized readers will support deletes. It seems that Iceberg is also adding vectorized readers and I assume Iceberg implementations will support deletes. @openinx @sundargates @tweise ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
