- Cassandra 2.1.13 - SSDs - LeveledCompactionStrategy - Range repair (not incremental) with Spotify's Reaper https://github.com/spotify/cassandra-reaper
Problem : When we run a repair job sometimes the SSTable count goes to 10K on one of nodes (not always the same node). The Reaper is smart enough to postpone the repair on this node since the number of pending compactions is > 20 but number of SSTables stays around 10K. Even If I set the compactionthroughput 0 (disable throttling) the SSTable count stays around 10K. Workaround: If we abort the repair, and restart the node it quickly (in 15 minutes) goes back to 200 SSTables ... Any suggestions as what I should look at ? When it occurs, I've noticed that nodetool compactionstats and cfstats (on the table with 10K SSTables) takes minutes to return with a result. I thought that the issue might be related to https://issues.apache.org/jira/browse/CASSANDRA-10766 as I see the MemtablePostFlush waiting on the countdown latch but the Pending MemtablePostFlush is going up and down according to tpstats. Complete stack trace : http://pastebin.com/K1r3CUff I took some tpstats (roughly every minutes). Only these pools are not at 0 (Active/Pending). Pool Name Active Pending Completed Blocked All time blocked MemtableFlushWriter 2 2 139864 0 0 MemtablePostFlush 1 13 223714 0 0 CompactionExecutor 10 10 804964 0 0 MemtableFlushWriter 4 4 139889 0 0 MemtablePostFlush 1 12 223744 0 0 CompactionExecutor 12 12 805365 0 0 MemtableFlushWriter 5 5 139896 0 0 MemtablePostFlush 1 10 223755 0 0 CompactionExecutor 9 9 805503 0 0 MemtableFlushWriter 4 4 139907 0 0 MemtablePostFlush 1 13 223762 0 0 CompactionExecutor 9 9 805703 0 0 MemtableFlushWriter 5 5 139927 0 0 MemtablePostFlush 1 14 223783 0 0 CompactionExecutor 10 10 805971 0 0 MemtableFlushWriter 7 7 139956 0 0 MemtablePostFlush 1 23 223806 0 0 CompactionExecutor 10 10 806428 0 0 nodetool compactionstats shows pending tasks 66 Keyspace: foo Read Count: 6308735 Read Latency: 12.132909585836147 ms. Write Count: 15394697 Write Latency: 0.09054346675351908 ms. Pending Flushes: 15 Table: bar SSTable count: 10326 SSTables in each level: [10090/4, 10, 106/100, 112, 0, 0, 0, 0, 0] Space used (live): 69204087872 Space used (total): 69206400092 Space used by snapshots (total): 2708047105 Off heap memory used (total): 35230672 SSTable Compression Ratio: 0.339043411676821 Number of keys (estimate): 1601158 Memtable cell count: 86524 Memtable data size: 6508214 Memtable off heap memory used: 0 Memtable switch count: 22719 Local read count: 6310549 Local read latency: 12.135 ms Local write count: 15397653 Local write latency: 0.091 ms Pending flushes: 10 Bloom filter false positives: 2282107 Bloom filter false ratio: 0.38494 Bloom filter space used: 3244792 Bloom filter off heap memory used: 3162168 Index summary off heap memory used: 3348360 Compression metadata off heap memory used: 28720144 Compacted partition minimum bytes: 87 Compacted partition maximum bytes: 2816159 Compacted partition mean bytes: 69860 Average live cells per slice (last five minutes): 817.6059838850788 Maximum live cells per slice (last five minutes): 5002.0 Average tombstones per slice (last five minutes): 0.0 Maximum tombstones per slice (last five minutes): 0.0 Thanks J-F Gosselin