alamb commented on issue #3520: URL: https://github.com/apache/arrow-rs/issues/3520#issuecomment-3254515876
> I'm happy to help with this, especially anything related to supporting REE in parquet Thank you @albertlockett One big missing piece I know if is the ability to read data from Parquet as REE arrays -- even though often the pages are compressed with the REE/Bit packing hybrid So the idea would be to specify to the parquet reader that we wanted to read data as an REE array and then implement the appropriate decoders to do that Before doing this it would likely be very helpful to create some sort of benchmark / example where reading an REE array directly would help a lot (maybe a column with very many repeated string values 🤔 ) -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org