cravani opened a new pull request, #3359:
URL: https://github.com/apache/parquet-java/pull/3359

   ### Rationale for this change
   When reading Parquet files with large metadata (e.g., files with thousands 
of columns), the default Thrift message size limit can be insufficient, causing 
TTransportException: Message size exceeds limit errors. Currently, the Thrift 
protocol configuration uses default max message size (100MB), preventing users 
from reading files with exceptionally large metadata footers.
   
   ### What changes are included in this PR?
   Add a new configuration key: parquet.thrift.string.size.limit
   Default value: 100 MB (104857600 bytes)
   Allow users to override this via Configuration
   
   ### Are these changes tested?
   Yes
   
   ### Are there any user-facing changes?
   Not by default, user can set config parquet.thrift.string.size.limit=<limit> 
to increase it based on need.
   
   Closes #GH-3358


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to