mbrobbel commented on code in PR #7558: URL: https://github.com/apache/arrow-rs/pull/7558#discussion_r2113252581
########## parquet/src/file/properties.rs: ########## @@ -614,6 +622,84 @@ impl WriterPropertiesBuilder { self } + /// Sets the max length of min/max value fields when writing the column + /// [`Index`] (defaults to `Some(64)` via [`DEFAULT_COLUMN_INDEX_TRUNCATE_LENGTH`]). + /// + /// This can be used to prevent columns with very long values (hundreds of + /// bytes long) from causing the parquet metadata to become huge. + /// + /// # Notes + /// + /// The column [`Index`] is written when [`Self::set_statistics_enabled`] is + /// set to [`EnabledStatistics::Page`]. + /// + /// * If `Some`, must be greater than 0, otherwise will panic Review Comment: Maybe at some point we could change this to use `NonZeroUsize`? -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org