etseidl opened a new issue, #8859: URL: https://github.com/apache/arrow-rs/issues/8859
**Is your feature request related to a problem or challenge? Please describe what you are trying to do.** #8797 adds the option to condense the page encoding statistics in the Parquet `ColumnMetaData` from a `Vec<PageEncodingStats>` to a bitmask. This reduces the number of allocations performed in the decoding of the Parquet metadata and thus speeds up metadata parsing. Currently the default behavior is to parse the full vector of encoding stats, but given the limited use of this information we should instead default to the more concise and performant bitmask. **Describe the solution you'd like** Change the default behavior, but leave an option to get the full stats if required. **Describe alternatives you've considered** No change to the defaults. **Additional context** This change should only be made in a major release as it is a significant behavior change. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
