ayushbansal07 opened a new pull request, #47758:
URL: https://github.com/apache/arrow/pull/47758

   ### Rationale for this change
   Support in pyarrow to read parquet files with page header statistics which 
contain values larger than 8MiB.
   
   ### What changes are included in this PR?
    - Add max_page_header_size in ReaderOptions ReaderProperties.
    - Remove max_page_header_size_ from SerializedPageReader and started 
reading it from properties_. Had to make properties_ inside 
SerializedPageReader non-const for set_max_page_header_size() api.
   
   ### Are these changes tested?
   Yes, tried with a sample python code.
   
   ### Are there any user-facing changes?
   Yes
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to