Aaron,

What version are you using ?

1.1.9

Have you changed the bf_ chance ? The sstables need to be rebuilt for it to 
take affect.

I did ( several times ) and I ran upgradesstables after

Not sure what this means.
Are you saying it's in a boat on a river, with tangerine trees and marmalade 
skies ?

You nailed it. A significant number of reads are done from hundreds of sstables 
( I have to add, compaction is apparently constantly 6000-7000 tasks behind and 
the vast majority of the reads access recently written data )

Take a look at the nodetool cfhistograms to get a better idea of the row size 
and use that info when consdiering the sstable size.

It's around 1-20K, what should I optimise the LCS sstable size for? I suppose 
"I want to fit as many complete rows as possible in to a single sstable to keep 
file count down while avoiding compactions of oversized ( double digit 
gigabytes? ) sstables at higher levels ? "
Do I have to run a major compaction after a change to sstable_size_in_mb ? The 
larger sstable size wouldn't really affect sstables on levels above L0 , would 
it?



Thanks!!
Andras


From: aaron morton <aa...@thelastpickle.com<mailto:aa...@thelastpickle.com>>
Reply-To: "user@cassandra.apache.org<mailto:user@cassandra.apache.org>" 
<user@cassandra.apache.org<mailto:user@cassandra.apache.org>>
Date: Tuesday 26 March 2013 21:46
To: "user@cassandra.apache.org<mailto:user@cassandra.apache.org>" 
<user@cassandra.apache.org<mailto:user@cassandra.apache.org>>
Subject: Re: bloom filter fp ratio of 0.98 with fp_chance of 0.01

What version are you using ?
1.2.0 allowed a null bf chance, and I think it returned .1 for LCS and .01 for 
STS compaction.
Have you changed the bf_ chance ? The sstables need to be rebuilt for it to 
take affect.

and sstables read is in the skies
Not sure what this means.
Are you saying it's in a boat on a river, with tangerine trees and marmalade 
skies ?

                SSTable count: 22682
Lots of files there, I imagine this would dilute the effectiveness of the key 
cache. It's caching (sstable, key) tuples.
You may want to look at increasing the sstable_size with LCS.

                Compacted row minimum size: 104
                Compacted row maximum size: 263210
                Compacted row mean size: 3041
Take a look at the nodetool cfhistograms to get a better idea of the row size 
and use that info when consdiering the sstable size.

Cheers
-----------------
Aaron Morton
Freelance Cassandra Consultant
New Zealand

@aaronmorton
http://www.thelastpickle.com

On 26/03/2013, at 6:16 AM, Andras Szerdahelyi 
<andras.szerdahe...@ignitionone.com<mailto:andras.szerdahe...@ignitionone.com>> 
wrote:

Hello list,

Could anyone shed some light on how an FP chance of 0.01 coexist with a 
measured FP ratio of .. 0.98 ? Am I reading this wrong or are 98% of the 
requests hitting the bloom filter create a false positive while the "target" 
false ratio is 0.01?
( Also key cache hit ratio is around 0.001 and sstables read is in the skies ( 
non-exponential (non-) drop off  for LCS )  but that should be filed under 
"effect" and not "cause"? )

[default@unknown] use KS;
Authenticated to keyspace: KS
[default@KS] describe CF;
    ColumnFamily: CF
      Key Validation Class: org.apache.cassandra.db.marshal.BytesType
      Default column value validator: org.apache.cassandra.db.marshal.BytesType
      Columns sorted by: org.apache.cassandra.db.marshal.BytesType
      GC grace seconds: 691200
      Compaction min/max thresholds: 4/32
      Read repair chance: 0.1
      DC Local Read repair chance: 0.0
      Replicate on write: true
      Caching: ALL
      Bloom Filter FP chance: 0.01
      Built indexes: []
      Compaction Strategy: 
org.apache.cassandra.db.compaction.LeveledCompactionStrategy
      Compaction Strategy Options:
        sstable_size_in_mb: 5
      Compression Options:
        sstable_compression: org.apache.cassandra.io.compress.SnappyCompressor

Keyspace: KS
        Read Count: 628950
        Read Latency: 93.19921121869784 ms.
        Write Count: 1219021
        Write Latency: 0.14352380885973254 ms.
        Pending Tasks: 0
                Column Family: CF
                SSTable count: 22682
                Space used (live): 119771434915
                Space used (total): 119771434915
                Number of Keys (estimate): 203837952
                Memtable Columns Count: 13125
                Memtable Data Size: 33212827
                Memtable Switch Count: 15
                Read Count: 629009
                Read Latency: 88.434 ms.
                Write Count: 1219038
                Write Latency: 0.095 ms.
                Pending Tasks: 0
                Bloom Filter False Positives: 37939419
                Bloom Filter False Ratio: 0.97928
                Bloom Filter Space Used: 261572784
                Compacted row minimum size: 104
                Compacted row maximum size: 263210
                Compacted row mean size: 3041

I upgraded sstables after changing the FP chance

Thanks!
Andras

Reply via email to