wjones127 opened a new pull request, #34668: URL: https://github.com/apache/arrow/pull/34668
### Rationale for this change We should make sure that we are defensive when it comes to invalid inputs. Otherwise malicious actors might send invalid Parquet files as a way to crash a users service. ### What changes are included in this PR? Adds a guard against trying to read empty pages. Also adds a DCHECK that validates `num_valid_values_` was initialized (it wasn't when `len = 0`). Also adds a few more general tests cases for the encoding, covering empty arrays and empty strings. ### Are these changes tested? Yes, several unit tests have been added. ### Are there any user-facing changes? **This PR contains a "Critical Fix".** -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
