sunchao commented on pull request #1284: URL: https://github.com/apache/arrow-rs/pull/1284#issuecomment-1040554342
@alamb @tustvold do you see many production use cases of DeltaBinaryPacked encoding? my understanding is most people are still using Parquet V1 format and hence PLAIN + DICTIONARY + RLE encodings. We also recently implemented support for DeltaBinaryPacked encoding in Spark and the read performance is slower than PLAIN even for sorted data, see [here](https://github.com/apache/spark/pull/34471#issuecomment-972526301). From the benchmark result above, it also appears to be slower than PLAIN (although on par with DICTIONARY). -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
