alamb opened a new issue, #14172: URL: https://github.com/apache/datafusion/issues/14172
### Is your feature request related to a problem or challenge? The most recent version of arrow deprecates the `max_statistics_size` parquet option - https://github.com/apache/datafusion/pull/14153/ You can see the rationale from @etseidl here (it was being silently ignored) - https://github.com/apache/arrow-rs/pull/6884 DataFusion also exposes this setting: https://datafusion.apache.org/user-guide/configs.html datafusion.execution.parquet.max_statistics_size | 4096 | (writing) Sets max statistics size for any column. If NULL, uses default parquet writer setting -- | -- | -- We should deprecate it as well prior to removal ### Describe the solution you'd like 1. Mark the option deprecated in comments 2. Make the field deprecated in the code The config option is defined in two places: https://github.com/apache/datafusion/blob/04b6d4d6099f537de91e1b30a391bdfbc3ec36d5/datafusion/common/src/config.rs#L1723-L1722 https://github.com/apache/datafusion/blob/04b6d4d6099f537de91e1b30a391bdfbc3ec36d5/datafusion/common/src/config.rs#L1723-L1722 But I think they are done via macro ### Describe alternatives you've considered We can just wait until the field is removed upstream in arrow and yank it from datafusion too ### Additional context _No response_ -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
