felipecrv commented on issue #40050: URL: https://github.com/apache/arrow/issues/40050#issuecomment-1946122536
> The error output is confusing, Is this a 'parquet' file? doesn't feel right if the error is known related to a string size limit parameter. So informing the user of this parameter inside the error message would definitely be an improvement. It's actually more likely that user pointed the reader to a random file that starts with bytes encoding a huge length value for the metadata string. Note that after asking if it's a Parquet file, it says `Couldn't deserialize thrift: TProtocolException: Exceeded size limit` -- perhaps that message could be improved with a hint on how to increase the Thrift size limit. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
