[ 
https://issues.apache.org/jira/browse/PARQUET-2373?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17789930#comment-17789930
 ] 

ASF GitHub Bot commented on PARQUET-2373:
-----------------------------------------

mapleFU commented on PR #1184:
URL: https://github.com/apache/parquet-mr/pull/1184#issuecomment-1827228087

   FYI, I've update a BloomFilter with length for testing: 
https://github.com/apache/parquet-testing/pull/43




> Improve I/O performance with bloom_filter_length
> ------------------------------------------------
>
>                 Key: PARQUET-2373
>                 URL: https://issues.apache.org/jira/browse/PARQUET-2373
>             Project: Parquet
>          Issue Type: Improvement
>            Reporter: Jiashen Zhang
>            Priority: Minor
>
> The spec PARQUET-2257 has added bloom_filter_length for reader to load the 
> bloom filter in a single shot. This implementation alters the code to make 
> use of the 'bloom_filter_length' field for loading the bloom filter 
> (consisting of the header and bitset) in order to enhance I/O scheduling.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to