alamb commented on issue #7490: URL: https://github.com/apache/arrow-rs/issues/7490#issuecomment-2925585282
> Makes me wonder if we should rethink `EnabledStatistics`. The Parquet spec actually recommends not writing page level statistics if the page indexes are written. Perhaps we could add something like `EnabledStatistics::ChunkAndIndex` to write chunk level and offset/column indexes but no statistics in the page header. After reviewing #7555 (see https://github.com/apache/arrow-rs/pull/7555#pullrequestreview-2884457173) I %100 agree -- I didn't even realize there were statistics in the page headers. I will file a ticket about this Also, while testing I found that the column index truncate length was 64 bytes by which is probably a good size for the main statisics length too. I'll make a PR -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org