[ 
https://issues.apache.org/jira/browse/HADOOP-10010?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13783117#comment-13783117
 ] 

Xiangrui Meng commented on HADOOP-10010:
----------------------------------------

Assume that a hash function selects each array position with equal probability. 
Then for an element not in the collection, the probability that the bloom 
filter returns true is

(numTrueBits/numBits)^numHashes

See http://en.wikipedia.org/wiki/Bloom_filter

I don't know what tests would be appropriate here.

> Add expectedFalsePositiveProbability to BloomFilter
> ---------------------------------------------------
>
>                 Key: HADOOP-10010
>                 URL: https://issues.apache.org/jira/browse/HADOOP-10010
>             Project: Hadoop Common
>          Issue Type: New Feature
>            Reporter: Xiangrui Meng
>         Attachments: fpp.patch
>
>   Original Estimate: 24h
>  Remaining Estimate: 24h
>
> It would be nice to see the expected false positive probability of a bloom 
> filter instance to check its quality. This is a simple function but needs 
> access to BloomFilter#bits.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

Reply via email to