Re: [PR] [DO NOT MERGE] PARQUET-2414: Add test file for additional BYTE_STREAM_SPLIT types [parquet-testing]

via GitHub Tue, 20 Feb 2024 07:10:29 -0800


wgtmac commented on code in PR #46:
URL: https://github.com/apache/parquet-testing/pull/46#discussion_r1495987249



##########
data/README.md:
##########
@@ -351,3 +353,37 @@ pq.write_table(
 
 This is a practical case where `BYTE_STREAM_SPLIT` encoding obtains a smaller 
file size than `PLAIN` or dictionary.
 Since the distributions are random normals centered at 0, each byte has 
nontrivial behavior.
+
+# Additional types
+
+`byte_stream_split_extended.gzip.parquet` is generated by pyarrow 16.0.0.
+It contains 7 pairs of columns, each in two variants containing the same
+values: one `PLAIN`-encoded and one `BYTE_STREAM_SPLIT`-encoded:
+```
+Version: 2.6

Review Comment:
   hmm... it seems we need to fix the version so here can correctly displays 
version 2.11



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Re: [PR] [DO NOT MERGE] PARQUET-2414: Add test file for additional BYTE_STREAM_SPLIT types [parquet-testing]

Reply via email to