Tobias Lindaaker created CASSANDRA-18507:
--------------------------------------------
Summary: Partial compaction can resurrect deleted data
Key: CASSANDRA-18507
URL: https://issues.apache.org/jira/browse/CASSANDRA-18507
Project: Cassandra
Issue Type: Bug
Reporter: Tobias Lindaaker
If there isn't enough disk space available to compact all existing sstables,
Cassandra will attempt to perform a partial compaction by removing sstables
from the set of candidate sstables to be compacted, starting with the largest
one. It is possible that the sstable removed from the set of sstables to
compact contains data for which there are tombstones in another (more recent)
sstable. Since the overlaps between sstables is computed when the
{{CompactionController}} is created, and the {{CompactionController}} is
created before the removal of any sstables from the set of sstables to be
compacted this computed overlap will be outdated when checking which sstables
are covered by certain tombstones. This leads to the faulty conclusion that the
tombstones can be pruned during the compaction, causing the data to be
resurrected.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]