sivabalan narayanan created HUDI-2703:
-----------------------------------------
Summary: [RFC-37] Metadata based bloom index
Key: HUDI-2703
URL: https://issues.apache.org/jira/browse/HUDI-2703
Project: Apache Hudi
Issue Type: Improvement
Reporter: sivabalan narayanan
Fix For: 0.10.0
Hudi has indices to assit in tagging incoming records. Most commonly used one
is Bloom index. This involves looking up (loading) bloom from data files which
could be time consuming and could have throttling impact in cloud stores like
S3. So, proposing this RFC to add bloom as a special partition in metadata
table and implement an index based on that.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)