[
https://issues.apache.org/jira/browse/IMPALA-3741?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17109643#comment-17109643
]
ASF subversion and git services commented on IMPALA-3741:
---------------------------------------------------------
Commit f7770f1d203e1edc24c5f147f4cc4bb56c3e34fc in impala's branch
refs/heads/master from wzhou-code
[ https://gitbox.apache.org/repos/asf?p=impala.git;h=f7770f1 ]
IMPALA-3741 [part 1]: Upgraded kudu/util for BloomFilter
Ported BlockBloomFilter related source files from Kudu upstream
to Impala be/src/kudu/util. The git hash of Kudu to take these
files is 389d4f1e1c.
Testing:
Passed core tests.
Change-Id: Ifac41ffb3e1742ffb6a969cb1c368d6d93c23357
Reviewed-on: http://gerrit.cloudera.org:8080/15676
Reviewed-by: Impala Public Jenkins <[email protected]>
Tested-by: Impala Public Jenkins <[email protected]>
> Push bloom filters to Kudu scanners
> -----------------------------------
>
> Key: IMPALA-3741
> URL: https://issues.apache.org/jira/browse/IMPALA-3741
> Project: IMPALA
> Issue Type: Task
> Components: Backend
> Affects Versions: Kudu_Impala
> Reporter: Matthew Jacobs
> Assignee: Wenzhe Zhou
> Priority: Major
> Labels: kudu, performance
>
> Impala relies on bloom filters to reduce number of rows from coming out of
> the scan node for selective joins.
> Queries get up to 20x speedup, not having bloom filter support in Kudu will
> create a big performance gap between Parquet and Kudu.
> https://github.com/cloudera/Impala/blob/cdh5-trunk/be/src/util/bloom-filter.h
--
This message was sent by Atlassian Jira
(v8.3.4#803005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]