hudi-bot opened a new issue, #16235:
URL: https://github.com/apache/hudi/issues/16235

   When using Athena v3 (Trino) to query a hudi table with 10k partitions 
noticed very slow query performance, compared to when disabling metadata 
feature. 
   
   
   It seems the bottleneck happens to be around when multiple trino hive thread 
attept reading of HFile data, are stuck in waiting states due to having to 
obtain lock. 
   
   
   Noticed that disabling following config
   
   ```
   
   Disabling {{CACHE_DATA_ON_READ}} for HFileReader in 
{{HoodieHFileReaderFactory}}
   
   ```
   
   can increase query perf when using metadata enabled but will need to do more 
investigation as to whether this has any side effect. 
   
   ## JIRA info
   
   - Link: https://issues.apache.org/jira/browse/HUDI-6876
   - Type: Bug
   - Fix version(s):
     - 1.1.0


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to