[jira] [Commented] (CASSANDRA-8683) Ensure early reopening has no overlap with replaced files

Alan Boudreault (JIRA) Mon, 09 Feb 2015 08:25:30 -0800

    [ 
https://issues.apache.org/jira/browse/CASSANDRA-8683?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14312407#comment-14312407
 ]


Alan Boudreault commented on CASSANDRA-8683:
--------------------------------------------

[~benedict] When using your branch, I'm getting this message when running my 
tests and doing a cleanup:

{code}
Aborted cleaning up atleast one column family in keyspace r1, check server logs 
for more information.
Aborted cleaning up atleast one column family in keyspace r1, check server logs 
for more information.
Aborted cleaning up atleast one column family in keyspace r1, check server logs 
for more information.
{code}

I can't get this outside of your branch. I'm not sure if this is an error or 
something expected with your changes. There are no errors in the logs. I only 
see this:

{code}
WARN  [RMI TCP Connection(24)-127.0.0.1] 2015-02-09 10:56:10,282 
ColumnFamilyStore.java:2587 - Unable to cancel in-progress compactions for 
standard1.  Perhaps there is an unusually large row in progress somewhere, or 
the system is simply overloaded.
INFO  [RMI TCP Connection(24)-127.0.0.1] 2015-02-09 10:56:10,282 
CompactionManager.java:255 - Aborting operation on r1.standard1 after failing 
to interrupt other compaction operations
INFO  [RMI TCP Connection(24)-127.0.0.1] 2015-02-09 10:56:10,282 
CompactionManager.java:260 - No sstables for r1.Counter3
INFO  [RMI TCP Connection(24)-127.0.0.1] 2015-02-09 10:56:10,285 
CompactionManager.java:260 - No sstables for system_traces.sessions
INFO  [RMI TCP Connection(24)-127.0.0.1] 2015-02-09 10:56:10,285 
CompactionManager.java:260 - No sstables for system_traces.events
WARN  [RMI TCP Connection(27)-127.0.0.1] 2015-02-09 10:59:13,550 
ColumnFamilyStore.java:2587 - Unable to cancel in-progress compactions for 
standard1.  Perhaps there is an unusually large row in progress somewhere, or 
the system is simply overloaded.
{code}

Any hint about this? Should I go further in the issue and produce a minimal 
test case to reproduce this message?

> Ensure early reopening has no overlap with replaced files
> ---------------------------------------------------------
>
>                 Key: CASSANDRA-8683
>                 URL: https://issues.apache.org/jira/browse/CASSANDRA-8683
>             Project: Cassandra
>          Issue Type: Bug
>            Reporter: Marcus Eriksson
>            Assignee: Benedict
>            Priority: Critical
>             Fix For: 2.1.3
>
>         Attachments: 0001-avoid-NPE-in-getPositionsForRanges.patch
>
>
> When introducing CASSANDRA-6916 we permitted the early opened files to 
> overlap with the files they were replacing by one DecoratedKey, as this 
> permitted a few minor simplifications. Unfortunately this breaks assumptions 
> in LeveledCompactionScanner, that are causing the intermittent unit test 
> failures: 
> http://cassci.datastax.com/job/trunk_utest/1330/testReport/junit/org.apache.cassandra.db.compaction/LeveledCompactionStrategyTest/testValidationMultipleSSTablePerLevel/
> This patch by itself does not fix the bug, but fixes the described aspect of 
> it, by ensuring the replaced and replacing files never overlap. This is 
> achieved first by always selecting the replaced file start as the next key 
> present in the file greater than the last key in the new file(s).  If there 
> is no such key, however, there is no data to return for the reader, but to 
> permit abort and atomic replacement at the end of a macro compaction action, 
> we must keep the file in the DataTracker for replacement purposes, but not 
> return it to consumers (esp. as many assume a non-empty range). For this I 
> have introduced a new OpenReason called SHADOWED, and a 
> DataTracker.View.shadowed collection of sstables, that tracks those we still 
> consider to be in the live set, but from which we no longer answer any 
> queries.
> CASSANDRA-8744 (and then CASSANDRA-8750) then ensures that these bounds are 
> honoured, so that we never break the assumption that files in LCS never 
> overlap.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (CASSANDRA-8683) Ensure early reopening has no overlap with replaced files

Reply via email to