Subhajit Sinha created HIVE-22528: ------------------------------------- Summary: Bloom Filter not showing up in Explain plan Key: HIVE-22528 URL: https://issues.apache.org/jira/browse/HIVE-22528 Project: Hive Issue Type: Bug Components: Hive Affects Versions: 3.1.0 Environment: Test Environment. Reporter: Subhajit Sinha
Hi Team, We are using Hive version (Apache Hive (version 3.1.0.3.1.0.0-78) and trying to implement Bloom filter in it. So basically I have created a managed table with table properties defined as: 'orc.bloom.filter.columns'='*******', 'orc.bloom.filter.fpp'='0.05', 'orc.stripe.size'='268435456', and stored it as orc file. While checking the explain plan(running: explain select count(1) from the_table where <condition>) in the current Hive version, I couldn't see anything as "Bloom_Filter" in the Plan provided by the CBO. The table I'm querying data in has records. I have a few doubts: # Is Hive 3.1 version not using Bloom filter? If so, I have queried a normal table with same query and condition have seen that it takes more time compared to a table having Bloom filter defined on the column that has condition. # Is there any parameter that needs to be set to get the value/ Bloom filter in the table? # I have come across three parameters, please let me know what does these signify : h5. hive.tez.max.bloom.filter.entries,hive.tez.min.bloom.filter.entries,hive.tez.bloom.filter.factor Please let me know if anyone has used Bloom filter. Let me know then the process -- This message was sent by Atlassian Jira (v8.3.4#803005)