wgtmac opened a new pull request, #1025:
URL: https://github.com/apache/parquet-mr/pull/1025

   ByteStreamSplitValuesReader depends on page.num_values which includes null 
values to compute the total stream length. Then it throws if it fails to read 
enough bytes from the page buffer. This certainly happens if the page contains 
null values.
   
   ### Jira
   
   [PARQUET-2241](https://issues.apache.org/jira/browse/PARQUET-2241)
   
   ### Tests
   
   Add test `org.apache.parquet.avro.TestByteStreamSplitE2E` to write and read 
floating values with BYTE_STREAM_SPLIT encoding.
   
   ### Commits
   
   `ByteStreamSplitValuesReader` strictly depends on remaining stream length to 
get actual number of encoded values before decoding.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to