[GitHub] [iceberg] RussellSpitzer opened a new issue #1281: Iceberg Datasource Writer Should Automatically Prune Identity Transform Partition Columns

GitBox Fri, 31 Jul 2020 07:55:14 -0700


RussellSpitzer opened a new issue #1281:
URL: https://github.com/apache/iceberg/issues/1281



   While writing the vectorized reader for Identity transforms in parquet I ran 
into the detail that when you write to a Parquet backed Iceberg table it will 
write columns for all data present even if those rows are already defined by 
the partitioning. I think it probably makes sense to strip out these columns on 
write to save space and time when writing. Any thoughts?


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]



---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[GitHub] [iceberg] RussellSpitzer opened a new issue #1281: Iceberg Datasource Writer Should Automatically Prune Identity Transform Partition Columns

Reply via email to