jonded94 commented on issue #7489: URL: https://github.com/apache/arrow-rs/issues/7489#issuecomment-2936239923
@alamb a slightly offtopic question: Comparing `EnabledStatistics` `CHUNK` and `PAGE` (with `statistics_truncate_length` == `None`) it seems that in `PAGE` mode compared to `CHUNK` mode, the file is 3 times the size? `101450724 / 34341803 = 2.954...` As this is a 2 row parquet file, we surely have only a single data page and a single row group. I would have expected that it's writing statistics once per row group (from `CHUNK` mode), and once per data page (from `PAGE` mode). Thereby, I would have expected double the file size, not tripple? -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org