[
https://issues.apache.org/jira/browse/HUDI-6350?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
ASF GitHub Bot updated HUDI-6350:
---------------------------------
Labels: pull-request-available (was: )
> AWS Hive sync: allow to enable/disable MDT on athena
> -----------------------------------------------------
>
> Key: HUDI-6350
> URL: https://issues.apache.org/jira/browse/HUDI-6350
> Project: Apache Hudi
> Issue Type: New Feature
> Reporter: nicolas paris
> Priority: Major
> Labels: pull-request-available
>
> athena has a nice (but hidden) feature to leverage the hudi metadata table
> instead of listing files on s3. This in theorry reduce the s3 slow down
> trouble (too much listing), speeds-up query planning.
>
> THis can be easily achieved by adding table property:
> hudi.metadata-listing-enabled'='TRUE"
>
> While on athena v2, this feature really helps, on athena v3 at the time of
> writing this, something is going very wrong and the query can be x100 slower.
> see https://docs.aws.amazon.com/athena/latest/ug/querying-hudi.html
--
This message was sent by Atlassian Jira
(v8.20.10#820010)