wgtmac opened a new pull request, #1025: URL: https://github.com/apache/parquet-mr/pull/1025
ByteStreamSplitValuesReader depends on page.num_values which includes null values to compute the total stream length. Then it throws if it fails to read enough bytes from the page buffer. This certainly happens if the page contains null values. ### Jira [PARQUET-2241](https://issues.apache.org/jira/browse/PARQUET-2241) ### Tests Add test `org.apache.parquet.avro.TestByteStreamSplitE2E` to write and read floating values with BYTE_STREAM_SPLIT encoding. ### Commits `ByteStreamSplitValuesReader` strictly depends on remaining stream length to get actual number of encoded values before decoding. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
