alamb commented on issue #3520:
URL: https://github.com/apache/arrow-rs/issues/3520#issuecomment-3254515876

   > I'm happy to help with this, especially anything related to supporting REE 
in parquet
   
   Thank you @albertlockett 
   
   One big missing piece I know if is the ability to read data from Parquet as 
REE arrays -- even though often the pages are compressed with the REE/Bit 
packing hybrid
   
   So the idea would be to specify to the parquet reader that we wanted to read 
data as an REE array and then implement the appropriate decoders to do that
   
   Before doing this it would likely be very helpful to create some sort of 
benchmark / example where reading an REE array directly would help a lot (maybe 
a column with very many repeated string values 🤔 )


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

Reply via email to