[ 
https://issues.apache.org/jira/browse/CASSANDRA-4303?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13287349#comment-13287349
 ] 

Daniel Doubleday commented on CASSANDRA-4303:
---------------------------------------------

Out of curiosity because I already started experimenting in this direction:

Has anybody benchmarked the option to keep bloom filters on disk. I did some 
simple tests and could read around 6M random ints / sec from a paged-in 
SegmentedFile. My line of thinking was: why not letting the os decide where the 
memory should be allocated because its a tradeoff sst vs other data kept in mem 
anyways.

I'm not close to a patch to do some real world testing so maybe you can save me 
from the effort because someone else tried and learned that it does not make 
sense.
                
> Compressed bloomfilters
> -----------------------
>
>                 Key: CASSANDRA-4303
>                 URL: https://issues.apache.org/jira/browse/CASSANDRA-4303
>             Project: Cassandra
>          Issue Type: Improvement
>            Reporter: Brandon Williams
>             Fix For: 1.2
>
>
> Very commonly, people encountering an OOM need to increase their bloom filter 
> false positive ratio to reduce memory pressure, since BFs tend to be the 
> largest shareholder.  It would make sense if we could alleviate the memory 
> pressure from BFs with compression while maintaining the FP ratio (at the 
> cost of a bit of cpu) that some users have come to expect.  One possible 
> implementation is at http://code.google.com/p/javaewah/

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to