jonded94 commented on issue #7489:
URL: https://github.com/apache/arrow-rs/issues/7489#issuecomment-2936239923

   @alamb a slightly offtopic question: Comparing `EnabledStatistics` `CHUNK` 
and `PAGE` (with `statistics_truncate_length` == `None`) it seems that in 
`PAGE` mode compared to `CHUNK` mode, the file is 3 times the size?
   
   `101450724 / 34341803 = 2.954...`
   
   As this is a 2 row parquet file, we surely have only a single data page and 
a single row group. 
   
   I would have expected that it's writing statistics once per row group (from 
`CHUNK` mode), and once per data page (from `PAGE` mode). Thereby, I would have 
expected double the file size, not tripple?


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

Reply via email to