etseidl opened a new issue, #8859:
URL: https://github.com/apache/arrow-rs/issues/8859

   **Is your feature request related to a problem or challenge? Please describe 
what you are trying to do.**
   #8797 adds the option to condense the page encoding statistics in the 
Parquet `ColumnMetaData` from a `Vec<PageEncodingStats>` to a bitmask. This 
reduces the number of allocations performed in the decoding of the Parquet 
metadata and thus speeds up metadata parsing. Currently the default behavior is 
to parse the full vector of encoding stats, but given the limited use of this 
information we should instead default to the more concise and performant 
bitmask.
   
   **Describe the solution you'd like**
   Change the default behavior, but leave an option to get the full stats if 
required.
   
   **Describe alternatives you've considered**
   No change to the defaults.
   
   **Additional context**
   This change should only be made in a major release as it is a significant 
behavior change.
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to