mwlon opened a new pull request, #5293: URL: https://github.com/apache/arrow-rs/pull/5293
# Which issue does this PR close? Closes #4102. This builds on the previous attempt at a PR: #4183 # Rationale for this change This brings us up to speed with the full set of Parquet encodings, I believe. It will also be important for the likely addition of f16 and fixed len byte arrays to the byte stream split encoding. # What changes are included in this PR? * implemented byte stream split encoding * benchmark suite for encodings * Measured the performance as 3x faster than the previous PR's implementation. There are definitely more potential performance wins to be had with SIMD though. * tests for the new encoding, including a test file in parquet-testing created via pyarrow. # Are there any user-facing changes? No API additions -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org