[ https://issues.apache.org/jira/browse/PARQUET-41?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16331898#comment-16331898 ]
ASF GitHub Bot commented on PARQUET-41: --------------------------------------- cjjnjust opened a new pull request #432: PARQUET-41: Add bloom filter for parquet URL: https://github.com/apache/parquet-cpp/pull/432 This is first part of bloom filter patch set, which include a bloom filter utility and also some unit tests. Note that this patch also includes murmur3Hash original code from Austin Appleby. The code isn't formatted as parquet-cpp format. ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: us...@infra.apache.org > Add bloom filters to parquet statistics > --------------------------------------- > > Key: PARQUET-41 > URL: https://issues.apache.org/jira/browse/PARQUET-41 > Project: Parquet > Issue Type: New Feature > Components: parquet-format, parquet-mr > Reporter: Alex Levenson > Assignee: Ferdinand Xu > Priority: Major > Labels: filter2 > > For row groups with no dictionary, we could still produce a bloom filter. > This could be very useful in filtering entire row groups. > Pull request: > https://github.com/apache/parquet-mr/pull/215 -- This message was sent by Atlassian JIRA (v7.6.3#76005)