optimize JDBM to make mapjoin faster
------------------------------------
Key: HIVE-1702
URL: https://issues.apache.org/jira/browse/HIVE-1702
Project: Hadoop Hive
Issue Type: Improvement
Components: Query Processor
Reporter: Namit Jain
Assignee: Liyin Tang
Htree.get() cost 70% total time. It could help a lot if there is bloom filter
here to avoid unneeded get() if we know for sure the given key is not in JDBM.
(we can generate the bloom filter when doing the jdbm sink, and read into
memory when doing read. )
Copied from https://issues.apache.org/jira/browse/HIVE-1700
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.