hudi-bot opened a new issue, #16235:
URL: https://github.com/apache/hudi/issues/16235
When using Athena v3 (Trino) to query a hudi table with 10k partitions
noticed very slow query performance, compared to when disabling metadata
feature.
It seems the bottleneck happens to be around when multiple trino hive thread
attept reading of HFile data, are stuck in waiting states due to having to
obtain lock.
Noticed that disabling following config
```
Disabling {{CACHE_DATA_ON_READ}} for HFileReader in
{{HoodieHFileReaderFactory}}
```
can increase query perf when using metadata enabled but will need to do more
investigation as to whether this has any side effect.
## JIRA info
- Link: https://issues.apache.org/jira/browse/HUDI-6876
- Type: Bug
- Fix version(s):
- 1.1.0
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]