Ssv-21 opened a new issue, #12674: URL: https://github.com/apache/hudi/issues/12674
I’m trying to understand how well AWS Redshift Spectrum supports data skipping when working with Hudi datasets. From the [Athena documentation](https://docs.aws.amazon.com/athena/latest/ug/querying-hudi-metadata-table.html), it’s clear that Athena supports file listing from Hudi’s meta table but doesn’t utilize features like column statistics or bloom filters for data skipping. However, the official docs do not mention this for Redshift Spectrum. Does anyone know if Redshift Spectrum, Supports column statistics and bloom filters from Hudi’s meta table for query optimization? or it Works the same as Athena by only supporting file listing? Any clarification or references would be really helpful. Thanks! -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
