Ssv-21 opened a new issue, #12674:
URL: https://github.com/apache/hudi/issues/12674

   I’m trying to understand how well AWS Redshift Spectrum supports data 
skipping when working with Hudi datasets.
   
   From the [Athena 
documentation](https://docs.aws.amazon.com/athena/latest/ug/querying-hudi-metadata-table.html),
 it’s clear that Athena supports file listing from Hudi’s meta table but 
doesn’t utilize features like column statistics or bloom filters for data 
skipping. However, the official docs do not mention this for Redshift Spectrum.
   
   Does anyone know if Redshift Spectrum, Supports column statistics and bloom 
filters from Hudi’s meta table for query optimization? 
   or it Works the same as Athena by only supporting file listing?
   
   Any clarification or references would be really helpful. Thanks!


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to