wgtmac commented on code in PR #46: URL: https://github.com/apache/parquet-testing/pull/46#discussion_r1495987249
########## data/README.md: ########## @@ -351,3 +353,37 @@ pq.write_table( This is a practical case where `BYTE_STREAM_SPLIT` encoding obtains a smaller file size than `PLAIN` or dictionary. Since the distributions are random normals centered at 0, each byte has nontrivial behavior. + +# Additional types + +`byte_stream_split_extended.gzip.parquet` is generated by pyarrow 16.0.0. +It contains 7 pairs of columns, each in two variants containing the same +values: one `PLAIN`-encoded and one `BYTE_STREAM_SPLIT`-encoded: +``` +Version: 2.6 Review Comment: hmm... it seems we need to fix the version so here can correctly displays version 2.11 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
