[GitHub] [iceberg] vvellanki commented on pull request #2182: Support for PartitionStatsFile in each snapshot

GitBox Wed, 03 Feb 2021 01:53:35 -0800


vvellanki commented on pull request #2182:
URL: https://github.com/apache/iceberg/pull/2182#issuecomment-772380537



   @rdblue The use case for tracking per-partition stats is mainly for planning 
purposes - estimating the number of rows (for join planning, and join ordering) 
and the number of files to read based on the dataset that needs to be read.
   
   As of now, the submitted change rewrites only the required files. Are 
suggesting the following approach:
   - Commit the transaction by including placeholders for the partition stats 
files
   - Asynchronously write the partition stats files


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]



---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[GitHub] [iceberg] vvellanki commented on pull request #2182: Support for PartitionStatsFile in each snapshot

Reply via email to