Did you try checking (using nodetool getsstables) how many SSTables your
row's data are spread into? All the "parts" of the row have to be in one
SSTable to remove it (data & tombstone). Remember, that even if you do
not update your data, you still may have two SSTables containing row's
data (one had "real" data, second one contains tombstone).
Nodetool cleanup & restarting C* will definitely not help in this case.
One more thing you can try is forceUserDefinedCompaction called on
single SSTable via JMX (or on two SSTables - data & tombstone ones - if
you can figure out "where" your data "lies" ;-) ). If you do and it
works, and you run scheduler repairs, keep this:
https://issues.apache.org/jira/browse/CASSANDRA-4905 in mind.
M.
W dniu 04.03.2013 08:07, Matthias Zeilinger pisze:
Hi,
I´m running Cassandra 1.1.5 and have following issue.
I´m using a 10 days TTL on my CF. I can see a lot of tombstones in there, but
they aren´t deleted after compaction.
I have tried a nodetool -cleanup and also a restart of Cassandra, but nothing
happened.
total 61G
drwxr-xr-x 2 cassandra dba 20K Mar 4 06:35 .
drwxr-xr-x 10 cassandra dba 4.0K Dec 10 13:05 ..
-rw-r--r-- 1 cassandra dba 15M Dec 15 22:04
whatever-he-1398-CompressionInfo.db
-rw-r--r-- 1 cassandra dba 19G Dec 15 22:04 whatever-he-1398-Data.db
-rw-r--r-- 1 cassandra dba 15M Dec 15 22:04 whatever-he-1398-Filter.db
-rw-r--r-- 1 cassandra dba 357M Dec 15 22:04 whatever-he-1398-Index.db
-rw-r--r-- 1 cassandra dba 4.3K Dec 15 22:04 whatever-he-1398-Statistics.db
-rw-r--r-- 1 cassandra dba 9.5M Feb 6 15:45
whatever-he-5464-CompressionInfo.db
-rw-r--r-- 1 cassandra dba 12G Feb 6 15:45 whatever-he-5464-Data.db
-rw-r--r-- 1 cassandra dba 48M Feb 6 15:45 whatever-he-5464-Filter.db
-rw-r--r-- 1 cassandra dba 736M Feb 6 15:45 whatever-he-5464-Index.db
-rw-r--r-- 1 cassandra dba 4.3K Feb 6 15:45 whatever-he-5464-Statistics.db
-rw-r--r-- 1 cassandra dba 9.7M Feb 21 19:13
whatever-he-6829-CompressionInfo.db
-rw-r--r-- 1 cassandra dba 12G Feb 21 19:13 whatever-he-6829-Data.db
-rw-r--r-- 1 cassandra dba 47M Feb 21 19:13 whatever-he-6829-Filter.db
-rw-r--r-- 1 cassandra dba 792M Feb 21 19:13 whatever-he-6829-Index.db
-rw-r--r-- 1 cassandra dba 4.3K Feb 21 19:13 whatever-he-6829-Statistics.db
-rw-r--r-- 1 cassandra dba 3.7M Mar 1 10:46
whatever-he-7578-CompressionInfo.db
-rw-r--r-- 1 cassandra dba 4.3G Mar 1 10:46 whatever-he-7578-Data.db
-rw-r--r-- 1 cassandra dba 12M Mar 1 10:46 whatever-he-7578-Filter.db
-rw-r--r-- 1 cassandra dba 274M Mar 1 10:46 whatever-he-7578-Index.db
-rw-r--r-- 1 cassandra dba 4.3K Mar 1 10:46 whatever-he-7578-Statistics.db
-rw-r--r-- 1 cassandra dba 3.6M Mar 1 11:21
whatever-he-7582-CompressionInfo.db
-rw-r--r-- 1 cassandra dba 4.3G Mar 1 11:21 whatever-he-7582-Data.db
-rw-r--r-- 1 cassandra dba 9.7M Mar 1 11:21 whatever-he-7582-Filter.db
-rw-r--r-- 1 cassandra dba 236M Mar 1 11:21 whatever-he-7582-Index.db
-rw-r--r-- 1 cassandra dba 4.3K Mar 1 11:21 whatever-he-7582-Statistics.db
-rw-r--r-- 1 cassandra dba 3.7M Mar 3 12:13
whatever-he-7869-CompressionInfo.db
-rw-r--r-- 1 cassandra dba 4.3G Mar 3 12:13 whatever-he-7869-Data.db
-rw-r--r-- 1 cassandra dba 9.8M Mar 3 12:13 whatever-he-7869-Filter.db
-rw-r--r-- 1 cassandra dba 239M Mar 3 12:13 whatever-he-7869-Index.db
-rw-r--r-- 1 cassandra dba 4.3K Mar 3 12:13 whatever-he-7869-Statistics.db
-rw-r--r-- 1 cassandra dba 924K Mar 3 18:02
whatever-he-7953-CompressionInfo.db
-rw-r--r-- 1 cassandra dba 1.1G Mar 3 18:02 whatever-he-7953-Data.db
-rw-r--r-- 1 cassandra dba 2.1M Mar 3 18:02 whatever-he-7953-Filter.db
-rw-r--r-- 1 cassandra dba 51M Mar 3 18:02 whatever-he-7953-Index.db
-rw-r--r-- 1 cassandra dba 4.3K Mar 3 18:02 whatever-he-7953-Statistics.db
-rw-r--r-- 1 cassandra dba 231K Mar 3 20:06
whatever-he-7974-CompressionInfo.db
-rw-r--r-- 1 cassandra dba 268M Mar 3 20:06 whatever-he-7974-Data.db
-rw-r--r-- 1 cassandra dba 483K Mar 3 20:06 whatever-he-7974-Filter.db
-rw-r--r-- 1 cassandra dba 12M Mar 3 20:06 whatever-he-7974-Index.db
-rw-r--r-- 1 cassandra dba 4.3K Mar 3 20:06 whatever-he-7974-Statistics.db
-rw-r--r-- 1 cassandra dba 116K Mar 4 06:28
whatever-he-8002-CompressionInfo.db
-rw-r--r-- 1 cassandra dba 146M Mar 4 06:28 whatever-he-8002-Data.db
-rw-r--r-- 1 cassandra dba 646K Mar 4 06:28 whatever-he-8002-Filter.db
-rw-r--r-- 1 cassandra dba 16M Mar 4 06:28 whatever-he-8002-Index.db
-rw-r--r-- 1 cassandra dba 4.3K Mar 4 06:28 whatever-he-8002-Statistics.db
-rw-r--r-- 1 cassandra dba 58K Mar 4 06:28
whatever-he-8003-CompressionInfo.db
-rw-r--r-- 1 cassandra dba 67M Mar 4 06:28 whatever-he-8003-Data.db
-rw-r--r-- 1 cassandra dba 105K Mar 4 06:28 whatever-he-8003-Filter.db
-rw-r--r-- 1 cassandra dba 2.5M Mar 4 06:28 whatever-he-8003-Index.db
-rw-r--r-- 1 cassandra dba 4.3K Mar 4 06:28 whatever-he-8003-Statistics.db
-rw-r--r-- 1 cassandra dba 230K Mar 4 06:30
whatever-he-8004-CompressionInfo.db
-rw-r--r-- 1 cassandra dba 261M Mar 4 06:30 whatever-he-8004-Data.db
-rw-r--r-- 1 cassandra dba 480K Mar 4 06:30 whatever-he-8004-Filter.db
-rw-r--r-- 1 cassandra dba 12M Mar 4 06:30 whatever-he-8004-Index.db
-rw-r--r-- 1 cassandra dba 4.3K Mar 4 06:30 whatever-he-8004-Statistics.db
-rw-r--r-- 1 cassandra dba 15K Mar 4 06:30
whatever-he-8005-CompressionInfo.db
-rw-r--r-- 1 cassandra dba 16M Mar 4 06:30 whatever-he-8005-Data.db
-rw-r--r-- 1 cassandra dba 39K Mar 4 06:30 whatever-he-8005-Filter.db
-rw-r--r-- 1 cassandra dba 944K Mar 4 06:30 whatever-he-8005-Index.db
-rw-r--r-- 1 cassandra dba 4.3K Mar 4 06:30 whatever-he-8005-Statistics.db
-rw-r--r-- 1 cassandra dba 5.0K Mar 4 06:35
whatever-he-8006-CompressionInfo.db
-rw-r--r-- 1 cassandra dba 6.7M Mar 4 06:35 whatever-he-8006-Data.db
-rw-r--r-- 1 cassandra dba 81K Mar 4 06:35 whatever-he-8006-Filter.db
-rw-r--r-- 1 cassandra dba 2.0M Mar 4 06:35 whatever-he-8006-Index.db
-rw-r--r-- 1 cassandra dba 4.3K Mar 4 06:35 whatever-he-8006-Statistics.db
The things marked in red, I guess, are the old data, but they aren´t deleted.
As you can see on the date, they are older than 10 days.
Is there any possibility to delete them?
Here is also the schema of the CF:
create column family whatever
with column_type = 'Standard'
and comparator = 'AsciiType'
and default_validation_class = 'AsciiType'
and key_validation_class = 'AsciiType'
and read_repair_chance = 0.0
and dclocal_read_repair_chance = 0.0
and gc_grace = 0
and min_compaction_threshold = 4
and max_compaction_threshold = 32
and replicate_on_write = false
and compaction_strategy =
'org.apache.cassandra.db.compaction.SizeTieredCompactionStrategy'
and caching = 'KEYS_ONLY'
and compression_options = {'sstable_compression' :
'org.apache.cassandra.io.compress.SnappyCompressor'};
Br,
Matthias Zeilinger
Production Operation - Shared Services
P: +43 (0) 50 858-31185
M: +43 (0) 664 85-34459
E: matthias.zeilin...@bwinparty.com
bwin.party services (Austria) GmbH
Marxergasse 1B
A-1030 Vienna
www.bwinparty.com