RussellSpitzer opened a new issue #1281: URL: https://github.com/apache/iceberg/issues/1281
While writing the vectorized reader for Identity transforms in parquet I ran into the detail that when you write to a Parquet backed Iceberg table it will write columns for all data present even if those rows are already defined by the partitioning. I think it probably makes sense to strip out these columns on write to save space and time when writing. Any thoughts? ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
