[GitHub] [arrow-rs] sunchao commented on pull request #1284: Vectorized DeltaBitPackDecoder (#1281)

GitBox Tue, 15 Feb 2022 09:18:54 -0800


sunchao commented on pull request #1284:
URL: https://github.com/apache/arrow-rs/pull/1284#issuecomment-1040554342



   @alamb @tustvold do you see many production use cases of DeltaBinaryPacked 
encoding? my understanding is most people are still using Parquet V1 format and 
hence PLAIN + DICTIONARY + RLE encodings. We also recently implemented support 
for DeltaBinaryPacked encoding in Spark and the read performance is slower than 
PLAIN even for sorted data, see 
[here](https://github.com/apache/spark/pull/34471#issuecomment-972526301). From 
the benchmark result above, it also appears to be slower than PLAIN (although 
on par with DICTIONARY).


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

[GitHub] [arrow-rs] sunchao commented on pull request #1284: Vectorized DeltaBitPackDecoder (#1281)

Reply via email to