alamb commented on issue #7490:
URL: https://github.com/apache/arrow-rs/issues/7490#issuecomment-2925585282

   > Makes me wonder if we should rethink `EnabledStatistics`. The Parquet spec 
actually recommends not writing page level statistics if the page indexes are 
written. Perhaps we could add something like `EnabledStatistics::ChunkAndIndex` 
to write chunk level and offset/column indexes but no statistics in the page 
header.
   
   After reviewing #7555 (see 
https://github.com/apache/arrow-rs/pull/7555#pullrequestreview-2884457173) I 
%100 agree -- I didn't even realize there were statistics in the page headers. 
   
   I will file a ticket about this
   
   Also, while testing I found that the column index truncate length  was 64 
bytes by which is probably a good size for the main statisics length too. I'll 
make a PR


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

Reply via email to