[
https://issues.apache.org/jira/browse/HIVE-15788?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15980780#comment-15980780
]
Prasanth Jayachandran commented on HIVE-15788:
----------------------------------------------
Last time when I did JMH benchmark RoaringBitmap wasn't fast enough and that's
the reason why I did not use that in first place. I still doubt that this will
be faster. I am sure this will have better compression but it will have huge
performance hit. I guess [~jdere] also benchmarked it recently and concluded
the same (correct me if I am wrong). So I am -1 on replacing the default long[]
until proven otherwise.
> Implement FastBloomFilter to use RoaringBitmap instead of long[]
> -----------------------------------------------------------------
>
> Key: HIVE-15788
> URL: https://issues.apache.org/jira/browse/HIVE-15788
> Project: Hive
> Issue Type: Improvement
> Components: UDF
> Reporter: Gopal V
> Assignee: Murali Vemulapati
> Attachments: HIVE-15788.patch
>
>
> Currently, a bloom filter which is all 1s occupies the exact amount of space
> as a bloom filter which is sparse.
> This is an entire waste of space and produces memory pressure and generate a
> massive number of cache misses.
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)