optimize JDBM to make mapjoin faster
------------------------------------

                 Key: HIVE-1702
                 URL: https://issues.apache.org/jira/browse/HIVE-1702
             Project: Hadoop Hive
          Issue Type: Improvement
          Components: Query Processor
            Reporter: Namit Jain
            Assignee: Liyin Tang


Htree.get() cost 70% total time. It could help a lot if there is bloom filter 
here to avoid unneeded get() if we know for sure the given key is not in JDBM. 
(we can generate the bloom filter when doing the jdbm sink, and read into 
memory when doing read. )

Copied from https://issues.apache.org/jira/browse/HIVE-1700

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to