rdblue commented on pull request #2582:
URL: https://github.com/apache/iceberg/pull/2582#issuecomment-844578561


   @jshmchenxi, I think this should be done in several PRs instead of one. 
First, we would need to update the Parquet version, then we would want to add 
read support and finally we would add write support. That will help keep the 
changes to a size where reviewers can get through them in a reasonable amount 
of time.
   
   I also think that we need to more carefully consider how to configure 
Parquet's bloom filters. I would expect what you've added here as table 
properties to be column specific. Why did you choose global settings. Does this 
create a bloom filter with the same NDV for all columns?


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]



---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to