Subhajit Sinha created HIVE-22528:
-------------------------------------

             Summary: Bloom Filter not showing up in Explain plan
                 Key: HIVE-22528
                 URL: https://issues.apache.org/jira/browse/HIVE-22528
             Project: Hive
          Issue Type: Bug
          Components: Hive
    Affects Versions: 3.1.0
         Environment: Test Environment.
            Reporter: Subhajit Sinha


Hi Team,

We are using Hive version (Apache Hive (version 3.1.0.3.1.0.0-78) and trying to 
implement Bloom filter in it. So basically I have created a managed table with 
table properties defined as:

'orc.bloom.filter.columns'='*******',  'orc.bloom.filter.fpp'='0.05',  
'orc.stripe.size'='268435456',

and stored it as orc file. While checking the explain plan(running: explain 
select count(1) from the_table where <condition>) in the current Hive version, 
I couldn't see anything as "Bloom_Filter" in the Plan provided by the CBO. The 
table I'm querying data in has  records.

 

I have a few doubts:
 # Is Hive 3.1 version not using Bloom filter? If so, I have queried a normal 
table with same query and condition have seen that it takes more time compared 
to a table having Bloom filter defined on the column that has condition.
 # Is there any parameter that needs to be set to get the value/ Bloom filter 
in the table?
 # I have come across three parameters, please let me know what does these 
signify : 
h5. 
hive.tez.max.bloom.filter.entries,hive.tez.min.bloom.filter.entries,hive.tez.bloom.filter.factor

Please let me know if anyone has used Bloom filter. Let me know then the process



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to