pitrou commented on code in PR #35455:
URL: https://github.com/apache/arrow/pull/35455#discussion_r1219233736
##########
cpp/src/parquet/properties.h:
##########
@@ -525,7 +525,8 @@ class PARQUET_EXPORT WriterProperties {
/// Enable writing page index in general for all columns. Default disabled.
///
/// Page index contains statistics for data pages and can be used to skip
pages
- /// when scanning data in ordered and unordered columns.
+ /// when scanning data in ordered and unordered columns. Note that it does
not
Review Comment:
Perhaps:
> Writing statistics to the page index disables the old method of writing
statistics to each data page header.
> The page index makes filtering more efficient than the page header, as it
gathers all the statistics for a Parquet file in a single place, avoiding
scattered I/O.
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]