ayushbansal07 opened a new pull request, #47758:
URL: https://github.com/apache/arrow/pull/47758
### Rationale for this change
Support in pyarrow to read parquet files with page header statistics which
contain values larger than 8MiB.
### What changes are included in this PR?
- Add max_page_header_size in ReaderOptions ReaderProperties.
- Remove max_page_header_size_ from SerializedPageReader and started
reading it from properties_. Had to make properties_ inside
SerializedPageReader non-const for set_max_page_header_size() api.
### Are these changes tested?
Yes, tried with a sample python code.
### Are there any user-facing changes?
Yes
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]