wgtmac commented on issue #49715: URL: https://github.com/apache/arrow/issues/49715#issuecomment-4686753906
> we could also have a max relative dictionary cardinality setting +1. This should be a general improvement that applies to most common distributions. > we would switch to PLAIN or BYTE_STREAM_SPLIT This is a larger change if we take ALP into account. Adaptive encoding is a good idea and we can take a simple approach to try every possible encoding per column at the 1st page boundary and fix that encoding till the end of this column chunk. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
