wgtmac commented on issue #49715:
URL: https://github.com/apache/arrow/issues/49715#issuecomment-4686753906

   > we could also have a max relative dictionary cardinality setting
   
   +1. This should be a general improvement that applies to most common 
distributions.
   
   > we would switch to PLAIN or BYTE_STREAM_SPLIT
   
   This is a larger change if we take ALP into account. Adaptive encoding is a 
good idea and we can take a simple approach to try every possible encoding per 
column at the 1st page boundary and fix that encoding till the end of this 
column chunk.
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to