[ https://issues.apache.org/jira/browse/PARQUET-2226?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17677063#comment-17677063 ]
ASF GitHub Bot commented on PARQUET-2226: ----------------------------------------- yabola commented on code in PR #1020: URL: https://github.com/apache/parquet-mr/pull/1020#discussion_r1070638035 ########## parquet-column/src/test/java/org/apache/parquet/column/values/bloomfilter/TestBlockSplitBloomFilter.java: ########## @@ -181,6 +182,83 @@ public void testBloomFilterNDVs(){ assertTrue(bytes < 5 * 1024 * 1024); } + @Test + public void testMergeEmptyBloomFilter() throws IOException { Review Comment: I added a test for two BFs are not compatible. > Support merge Bloom Filter > -------------------------- > > Key: PARQUET-2226 > URL: https://issues.apache.org/jira/browse/PARQUET-2226 > Project: Parquet > Issue Type: Improvement > Reporter: Mars > Priority: Major > > We need to collect Parquet's bloom filter of multiple files, and then > synthesize a more comprehensive bloom filter for common use. > Guava supports similar api operations > https://guava.dev/releases/31.0.1-jre/api/docs/src-html/com/google/common/hash/BloomFilter.html#line.252 -- This message was sent by Atlassian Jira (v8.20.10#820010)