pitrou commented on code in PR #47090: URL: https://github.com/apache/arrow/pull/47090#discussion_r2285483500
########## cpp/src/parquet/properties.h: ########## @@ -155,6 +155,7 @@ class PARQUET_EXPORT ReaderProperties { ReaderProperties PARQUET_EXPORT default_reader_properties(); static constexpr int64_t kDefaultDataPageSize = 1024 * 1024; +static constexpr int64_t kDefaultMaxRowsPerPage = 20'000; Review Comment: The goal is precisely to make the average data page size much smaller than 1MB, which is considered too large as a compression/encoding unit. 1MB is an additional limit in case individual values are large. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org