xloya opened a new pull request #3724: URL: https://github.com/apache/iceberg/pull/3724
When rewriting, the data file read by `Spark3BinPackStrategy` through `SparkSession.read()` and the pieces of data after applying delete files may be 0. At this time, it will throw a `IllegalArgumentException` which is "Cannot stage null or empty file set" in `FileRewriteCoordinator.stageRewrite()` when rewriting. For these data files, if their split_offsets list do not exist or there is only one element, it can be directly expired, speeding up query efficiency. cc @RussellSpitzer @rdblue @kbendick @jackye1995 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
