[
https://issues.apache.org/jira/browse/HADOOP-10010?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13783117#comment-13783117
]
Xiangrui Meng commented on HADOOP-10010:
----------------------------------------
Assume that a hash function selects each array position with equal probability.
Then for an element not in the collection, the probability that the bloom
filter returns true is
(numTrueBits/numBits)^numHashes
See http://en.wikipedia.org/wiki/Bloom_filter
I don't know what tests would be appropriate here.
> Add expectedFalsePositiveProbability to BloomFilter
> ---------------------------------------------------
>
> Key: HADOOP-10010
> URL: https://issues.apache.org/jira/browse/HADOOP-10010
> Project: Hadoop Common
> Issue Type: New Feature
> Reporter: Xiangrui Meng
> Attachments: fpp.patch
>
> Original Estimate: 24h
> Remaining Estimate: 24h
>
> It would be nice to see the expected false positive probability of a bloom
> filter instance to check its quality. This is a simple function but needs
> access to BloomFilter#bits.
--
This message was sent by Atlassian JIRA
(v6.1#6144)