[ https://issues.apache.org/jira/browse/PARQUET-2373?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17789930#comment-17789930 ]
ASF GitHub Bot commented on PARQUET-2373: ----------------------------------------- mapleFU commented on PR #1184: URL: https://github.com/apache/parquet-mr/pull/1184#issuecomment-1827228087 FYI, I've update a BloomFilter with length for testing: https://github.com/apache/parquet-testing/pull/43 > Improve I/O performance with bloom_filter_length > ------------------------------------------------ > > Key: PARQUET-2373 > URL: https://issues.apache.org/jira/browse/PARQUET-2373 > Project: Parquet > Issue Type: Improvement > Reporter: Jiashen Zhang > Priority: Minor > > The spec PARQUET-2257 has added bloom_filter_length for reader to load the > bloom filter in a single shot. This implementation alters the code to make > use of the 'bloom_filter_length' field for loading the bloom filter > (consisting of the header and bitset) in order to enhance I/O scheduling. -- This message was sent by Atlassian Jira (v8.20.10#820010)