[I] [SUPPORT] Does AWS Redshift Spectrum fully support Hudi meta table features like column stats and bloom filters? [hudi]

via GitHub Sun, 19 Jan 2025 18:06:09 -0800


Ssv-21 opened a new issue, #12674:
URL: https://github.com/apache/hudi/issues/12674


   I’m trying to understand how well AWS Redshift Spectrum supports data 
skipping when working with Hudi datasets.
   
   From the [Athena 
documentation](https://docs.aws.amazon.com/athena/latest/ug/querying-hudi-metadata-table.html),
 it’s clear that Athena supports file listing from Hudi’s meta table but 
doesn’t utilize features like column statistics or bloom filters for data 
skipping. However, the official docs do not mention this for Redshift Spectrum.
   
   Does anyone know if Redshift Spectrum, Supports column statistics and bloom 
filters from Hudi’s meta table for query optimization? 
   or it Works the same as Athena by only supporting file listing?
   
   Any clarification or references would be really helpful. Thanks!


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

[I] [SUPPORT] Does AWS Redshift Spectrum fully support Hudi meta table features like column stats and bloom filters? [hudi]

Reply via email to