Rajesh Mahindra created HUDI-9627:
-------------------------------------
Summary: Add bloom filter pruning when looking up keys in metadata
files
Key: HUDI-9627
URL: https://issues.apache.org/jira/browse/HUDI-9627
Project: Apache Hudi
Issue Type: Improvement
Reporter: Rajesh Mahindra
When performing key look up in metadata (hfile) files, we only leverage the
hfile pruning (based on ranges). However, we are already writing bloom filters
for each hfile. This PR adds an optional ability to first filter the keys based
on the bloom filters, which can avoid the download and deser of a subset of the
Hfile blocks and reduce latencies. This should be essentially useful for upsert
workloads with mostly new(insert) records.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)