date:20150628

[jira] [Created] (CASSANDRA-9668) RepairException when trying to run concurrent repair -pr

2015-06-28 Thread david (JIRA)

david created CASSANDRA-9668:


 Summary: RepairException when trying to run concurrent repair -pr
 Key: CASSANDRA-9668
 URL: https://issues.apache.org/jira/browse/CASSANDRA-9668
 Project: Cassandra
  Issue Type: Bug
  Components: Core
 Environment: Cassandra 2.1.7
Reporter: david
Priority: Critical
 Fix For: 2.1.x


Was on 2.1.3 having very similar issues to those described in:

https://issues.apache.org/jira/browse/CASSANDRA-9266

I updated to 2.1.7, more for some other fixes, but now if I try and run 
concurrent repairs (different boxes) consistently get:

{noformat}
ERROR [Thread-14156] 2015-06-28 09:33:12,616 StorageService.java:2959 - Repair 
session b1e67660-1d78-11e5-aec7-4f05493cbe02 for range 
(-4660677346721084182,-4658765298409301171] failed with error 
org.apache.cassandra.exceptions.RepairException: [repair 
#b1e67660-1d78-11e5-aec7-4f05493cbe02 on keyspace/data, 
(-4660677346721084182,-4658765298409301171]] Validation failed in /172.31.13.127
java.util.concurrent.ExecutionException: java.lang.RuntimeException: 
org.apache.cassandra.exceptions.RepairException: [repair 
#b1e67660-1d78-11e5-aec7-4f05493cbe02 on keyspace/data, 
(-4660677346721084182,-4658765298409301171]] Validation failed in /172.31.13.127
at java.util.concurrent.FutureTask.report(FutureTask.java:122) 
[na:1.8.0_40]
at java.util.concurrent.FutureTask.get(FutureTask.java:192) 
[na:1.8.0_40]
at 
org.apache.cassandra.service.StorageService$4.runMayThrow(StorageService.java:2950)
 ~[apache-cassandra-2.1.7.jar:2.1.7]
at 
org.apache.cassandra.utils.WrappedRunnable.run(WrappedRunnable.java:28) 
[apache-cassandra-2.1.7.jar:2.1.7]
at 
java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511) 
[na:1.8.0_40]
at java.util.concurrent.FutureTask.run(FutureTask.java:266) 
[na:1.8.0_40]
at java.lang.Thread.run(Thread.java:745) [na:1.8.0_40]
Caused by: java.lang.RuntimeException: 
org.apache.cassandra.exceptions.RepairException: [repair 
#b1e67660-1d78-11e5-aec7-4f05493cbe02 on keyspace/data, 
(-4660677346721084182,-4658765298409301171]] Validation failed in /172.31.13.127
at com.google.common.base.Throwables.propagate(Throwables.java:160) 
~[guava-16.0.jar:na]
at 
org.apache.cassandra.utils.WrappedRunnable.run(WrappedRunnable.java:32) 
[apache-cassandra-2.1.7.jar:2.1.7]
at 
java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511) 
[na:1.8.0_40]
at java.util.concurrent.FutureTask.run(FutureTask.java:266) 
[na:1.8.0_40]
at 
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142) 
~[na:1.8.0_40]
at 
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617) 
~[na:1.8.0_40]
... 1 common frames omitted
Caused by: org.apache.cassandra.exceptions.RepairException: [repair 
#b1e67660-1d78-11e5-aec7-4f05493cbe02 on keyspace/data, 
(-4660677346721084182,-4658765298409301171]] Validation failed in /172.31.13.127
at 
org.apache.cassandra.repair.RepairSession.validationComplete(RepairSession.java:166)
 ~[apache-cassandra-2.1.7.jar:2.1.7]
at 
org.apache.cassandra.service.ActiveRepairService.handleMessage(ActiveRepairService.java:406)
 ~[apache-cassandra-2.1.7.jar:2.1.7]
at 
org.apache.cassandra.repair.RepairMessageVerbHandler.doVerb(RepairMessageVerbHandler.java:134)
 ~[apache-cassandra-2.1.7.jar:2.1.7]
at 
org.apache.cassandra.net.MessageDeliveryTask.run(MessageDeliveryTask.java:62) 
~[apache-cassandra-2.1.7.jar:2.1.7]
... 3 common frames omitted
{noformat}

The specific repair command being issued:
{noformat}
nodetool repair -local -pr -inc -par -- keyspace 
{noformat}

It's a 15 box environment with a replication factor of 3.




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[5/6] cassandra git commit: Merge branch 'cassandra-2.1' into cassandra-2.2

2015-06-28 Thread benedict

Merge branch 'cassandra-2.1' into cassandra-2.2

Conflicts:
build.xml


Project: http://git-wip-us.apache.org/repos/asf/cassandra/repo
Commit: http://git-wip-us.apache.org/repos/asf/cassandra/commit/02a7c342
Tree: http://git-wip-us.apache.org/repos/asf/cassandra/tree/02a7c342
Diff: http://git-wip-us.apache.org/repos/asf/cassandra/diff/02a7c342

Branch: refs/heads/cassandra-2.2
Commit: 02a7c342922a209ac7374f2f425c783a5faf8538
Parents: 14d7a63 bd4a9d1
Author: Benedict Elliott Smith bened...@apache.org
Authored: Sun Jun 28 11:39:53 2015 +0100
Committer: Benedict Elliott Smith bened...@apache.org
Committed: Sun Jun 28 11:39:53 2015 +0100

--

--

[3/6] cassandra git commit: backport burn test refactor

2015-06-28 Thread benedict

backport burn test refactor


Project: http://git-wip-us.apache.org/repos/asf/cassandra/repo
Commit: http://git-wip-us.apache.org/repos/asf/cassandra/commit/bd4a9d18
Tree: http://git-wip-us.apache.org/repos/asf/cassandra/tree/bd4a9d18
Diff: http://git-wip-us.apache.org/repos/asf/cassandra/diff/bd4a9d18

Branch: refs/heads/trunk
Commit: bd4a9d18e1317dcb8542bd4adc5a9f99b108d6c6
Parents: 8a56868
Author: Benedict Elliott Smith bened...@apache.org
Authored: Sun Jun 28 11:38:22 2015 +0100
Committer: Benedict Elliott Smith bened...@apache.org
Committed: Sun Jun 28 11:38:22 2015 +0100

--
 build.xml   |   7 +
 .../cassandra/concurrent/LongOpOrderTest.java   | 240 +
 .../concurrent/LongSharedExecutorPoolTest.java  | 226 +
 .../apache/cassandra/utils/LongBTreeTest.java   | 502 +++
 .../cassandra/concurrent/LongOpOrderTest.java   | 240 -
 .../concurrent/LongSharedExecutorPoolTest.java  | 228 -
 .../apache/cassandra/utils/LongBTreeTest.java   | 401 ---
 7 files changed, 975 insertions(+), 869 deletions(-)
--


http://git-wip-us.apache.org/repos/asf/cassandra/blob/bd4a9d18/build.xml
--
diff --git a/build.xml b/build.xml
index 73e76e5..18ad49f 100644
--- a/build.xml
+++ b/build.xml
@@ -93,6 +93,7 @@
 
 property name=test.timeout value=6 /
 property name=test.long.timeout value=60 /
+property name=test.burn.timeout value=60 /
 
 !-- default for cql tests. Can be override by 
-Dcassandra.test.use_prepared=false --
 property name=cassandra.test.use_prepared value=true /
@@ -1258,6 +1259,12 @@
 /testmacro
   /target
 
+  target name=test-burn depends=build-test description=Execute 
functional tests
+testmacro suitename=burn inputdir=${test.burn.src}
+   timeout=${test.burn.timeout}
+/testmacro
+  /target
+
   target name=long-test depends=build-test description=Execute 
functional tests
 testmacro suitename=long inputdir=${test.long.src}
timeout=${test.long.timeout}

http://git-wip-us.apache.org/repos/asf/cassandra/blob/bd4a9d18/test/burn/org/apache/cassandra/concurrent/LongOpOrderTest.java
--
diff --git a/test/burn/org/apache/cassandra/concurrent/LongOpOrderTest.java 
b/test/burn/org/apache/cassandra/concurrent/LongOpOrderTest.java
new file mode 100644
index 000..d7105df
--- /dev/null
+++ b/test/burn/org/apache/cassandra/concurrent/LongOpOrderTest.java
@@ -0,0 +1,240 @@
+package org.apache.cassandra.concurrent;
+/*
+ * 
+ * Licensed to the Apache Software Foundation (ASF) under one
+ * or more contributor license agreements.  See the NOTICE file
+ * distributed with this work for additional information
+ * regarding copyright ownership.  The ASF licenses this file
+ * to you under the Apache License, Version 2.0 (the
+ * License); you may not use this file except in compliance
+ * with the License.  You may obtain a copy of the License at
+ * 
+ *   http://www.apache.org/licenses/LICENSE-2.0
+ * 
+ * Unless required by applicable law or agreed to in writing,
+ * software distributed under the License is distributed on an
+ * AS IS BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY
+ * KIND, either express or implied.  See the License for the
+ * specific language governing permissions and limitations
+ * under the License.
+ * 
+ */
+
+
+import java.util.Map;
+import java.util.concurrent.ExecutorService;
+import java.util.concurrent.Executors;
+import java.util.concurrent.ScheduledExecutorService;
+import java.util.concurrent.ThreadLocalRandom;
+import java.util.concurrent.TimeUnit;
+import java.util.concurrent.atomic.AtomicInteger;
+
+import org.cliffc.high_scale_lib.NonBlockingHashMap;
+import org.junit.Test;
+import org.slf4j.Logger;
+import org.slf4j.LoggerFactory;
+
+import org.apache.cassandra.utils.concurrent.OpOrder;
+
+import static org.junit.Assert.assertTrue;
+
+// TODO: we don't currently test SAFE functionality at all!
+// TODO: should also test markBlocking and SyncOrdered
+public class LongOpOrderTest
+{
+
+private static final Logger logger = 
LoggerFactory.getLogger(LongOpOrderTest.class);
+
+static final int CONSUMERS = 4;
+static final int PRODUCERS = 32;
+
+static final long RUNTIME = TimeUnit.MINUTES.toMillis(5);
+static final long REPORT_INTERVAL = TimeUnit.MINUTES.toMillis(1);
+
+static final Thread.UncaughtExceptionHandler handler = new 
Thread.UncaughtExceptionHandler()
+{
+@Override
+public void uncaughtException(Thread t, Throwable e)
+{
+System.err.println(t.getName() + :  + e.getMessage());
+e.printStackTrace();
+}
+};
+
+final OpOrder order = new OpOrder();
+

[1/6] cassandra git commit: backport burn test refactor

2015-06-28 Thread benedict

Repository: cassandra
Updated Branches:
  refs/heads/cassandra-2.1 8a56868bc - bd4a9d18e
  refs/heads/cassandra-2.2 14d7a63b8 - 02a7c3429
  refs/heads/trunk 6739434c6 - 3671082b0


backport burn test refactor


Project: http://git-wip-us.apache.org/repos/asf/cassandra/repo
Commit: http://git-wip-us.apache.org/repos/asf/cassandra/commit/bd4a9d18
Tree: http://git-wip-us.apache.org/repos/asf/cassandra/tree/bd4a9d18
Diff: http://git-wip-us.apache.org/repos/asf/cassandra/diff/bd4a9d18

Branch: refs/heads/cassandra-2.1
Commit: bd4a9d18e1317dcb8542bd4adc5a9f99b108d6c6
Parents: 8a56868
Author: Benedict Elliott Smith bened...@apache.org
Authored: Sun Jun 28 11:38:22 2015 +0100
Committer: Benedict Elliott Smith bened...@apache.org
Committed: Sun Jun 28 11:38:22 2015 +0100

--
 build.xml   |   7 +
 .../cassandra/concurrent/LongOpOrderTest.java   | 240 +
 .../concurrent/LongSharedExecutorPoolTest.java  | 226 +
 .../apache/cassandra/utils/LongBTreeTest.java   | 502 +++
 .../cassandra/concurrent/LongOpOrderTest.java   | 240 -
 .../concurrent/LongSharedExecutorPoolTest.java  | 228 -
 .../apache/cassandra/utils/LongBTreeTest.java   | 401 ---
 7 files changed, 975 insertions(+), 869 deletions(-)
--


http://git-wip-us.apache.org/repos/asf/cassandra/blob/bd4a9d18/build.xml
--
diff --git a/build.xml b/build.xml
index 73e76e5..18ad49f 100644
--- a/build.xml
+++ b/build.xml
@@ -93,6 +93,7 @@
 
 property name=test.timeout value=6 /
 property name=test.long.timeout value=60 /
+property name=test.burn.timeout value=60 /
 
 !-- default for cql tests. Can be override by 
-Dcassandra.test.use_prepared=false --
 property name=cassandra.test.use_prepared value=true /
@@ -1258,6 +1259,12 @@
 /testmacro
   /target
 
+  target name=test-burn depends=build-test description=Execute 
functional tests
+testmacro suitename=burn inputdir=${test.burn.src}
+   timeout=${test.burn.timeout}
+/testmacro
+  /target
+
   target name=long-test depends=build-test description=Execute 
functional tests
 testmacro suitename=long inputdir=${test.long.src}
timeout=${test.long.timeout}

http://git-wip-us.apache.org/repos/asf/cassandra/blob/bd4a9d18/test/burn/org/apache/cassandra/concurrent/LongOpOrderTest.java
--
diff --git a/test/burn/org/apache/cassandra/concurrent/LongOpOrderTest.java 
b/test/burn/org/apache/cassandra/concurrent/LongOpOrderTest.java
new file mode 100644
index 000..d7105df
--- /dev/null
+++ b/test/burn/org/apache/cassandra/concurrent/LongOpOrderTest.java
@@ -0,0 +1,240 @@
+package org.apache.cassandra.concurrent;
+/*
+ * 
+ * Licensed to the Apache Software Foundation (ASF) under one
+ * or more contributor license agreements.  See the NOTICE file
+ * distributed with this work for additional information
+ * regarding copyright ownership.  The ASF licenses this file
+ * to you under the Apache License, Version 2.0 (the
+ * License); you may not use this file except in compliance
+ * with the License.  You may obtain a copy of the License at
+ * 
+ *   http://www.apache.org/licenses/LICENSE-2.0
+ * 
+ * Unless required by applicable law or agreed to in writing,
+ * software distributed under the License is distributed on an
+ * AS IS BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY
+ * KIND, either express or implied.  See the License for the
+ * specific language governing permissions and limitations
+ * under the License.
+ * 
+ */
+
+
+import java.util.Map;
+import java.util.concurrent.ExecutorService;
+import java.util.concurrent.Executors;
+import java.util.concurrent.ScheduledExecutorService;
+import java.util.concurrent.ThreadLocalRandom;
+import java.util.concurrent.TimeUnit;
+import java.util.concurrent.atomic.AtomicInteger;
+
+import org.cliffc.high_scale_lib.NonBlockingHashMap;
+import org.junit.Test;
+import org.slf4j.Logger;
+import org.slf4j.LoggerFactory;
+
+import org.apache.cassandra.utils.concurrent.OpOrder;
+
+import static org.junit.Assert.assertTrue;
+
+// TODO: we don't currently test SAFE functionality at all!
+// TODO: should also test markBlocking and SyncOrdered
+public class LongOpOrderTest
+{
+
+private static final Logger logger = 
LoggerFactory.getLogger(LongOpOrderTest.class);
+
+static final int CONSUMERS = 4;
+static final int PRODUCERS = 32;
+
+static final long RUNTIME = TimeUnit.MINUTES.toMillis(5);
+static final long REPORT_INTERVAL = TimeUnit.MINUTES.toMillis(1);
+
+static final Thread.UncaughtExceptionHandler handler = new 
Thread.UncaughtExceptionHandler()
+{
+@Override
+public void uncaughtException(Thread t, Throwable

[2/6] cassandra git commit: backport burn test refactor

2015-06-28 Thread benedict

backport burn test refactor


Project: http://git-wip-us.apache.org/repos/asf/cassandra/repo
Commit: http://git-wip-us.apache.org/repos/asf/cassandra/commit/bd4a9d18
Tree: http://git-wip-us.apache.org/repos/asf/cassandra/tree/bd4a9d18
Diff: http://git-wip-us.apache.org/repos/asf/cassandra/diff/bd4a9d18

Branch: refs/heads/cassandra-2.2
Commit: bd4a9d18e1317dcb8542bd4adc5a9f99b108d6c6
Parents: 8a56868
Author: Benedict Elliott Smith bened...@apache.org
Authored: Sun Jun 28 11:38:22 2015 +0100
Committer: Benedict Elliott Smith bened...@apache.org
Committed: Sun Jun 28 11:38:22 2015 +0100

--
 build.xml   |   7 +
 .../cassandra/concurrent/LongOpOrderTest.java   | 240 +
 .../concurrent/LongSharedExecutorPoolTest.java  | 226 +
 .../apache/cassandra/utils/LongBTreeTest.java   | 502 +++
 .../cassandra/concurrent/LongOpOrderTest.java   | 240 -
 .../concurrent/LongSharedExecutorPoolTest.java  | 228 -
 .../apache/cassandra/utils/LongBTreeTest.java   | 401 ---
 7 files changed, 975 insertions(+), 869 deletions(-)
--


http://git-wip-us.apache.org/repos/asf/cassandra/blob/bd4a9d18/build.xml
--
diff --git a/build.xml b/build.xml
index 73e76e5..18ad49f 100644
--- a/build.xml
+++ b/build.xml
@@ -93,6 +93,7 @@
 
 property name=test.timeout value=6 /
 property name=test.long.timeout value=60 /
+property name=test.burn.timeout value=60 /
 
 !-- default for cql tests. Can be override by 
-Dcassandra.test.use_prepared=false --
 property name=cassandra.test.use_prepared value=true /
@@ -1258,6 +1259,12 @@
 /testmacro
   /target
 
+  target name=test-burn depends=build-test description=Execute 
functional tests
+testmacro suitename=burn inputdir=${test.burn.src}
+   timeout=${test.burn.timeout}
+/testmacro
+  /target
+
   target name=long-test depends=build-test description=Execute 
functional tests
 testmacro suitename=long inputdir=${test.long.src}
timeout=${test.long.timeout}

http://git-wip-us.apache.org/repos/asf/cassandra/blob/bd4a9d18/test/burn/org/apache/cassandra/concurrent/LongOpOrderTest.java
--
diff --git a/test/burn/org/apache/cassandra/concurrent/LongOpOrderTest.java 
b/test/burn/org/apache/cassandra/concurrent/LongOpOrderTest.java
new file mode 100644
index 000..d7105df
--- /dev/null
+++ b/test/burn/org/apache/cassandra/concurrent/LongOpOrderTest.java
@@ -0,0 +1,240 @@
+package org.apache.cassandra.concurrent;
+/*
+ * 
+ * Licensed to the Apache Software Foundation (ASF) under one
+ * or more contributor license agreements.  See the NOTICE file
+ * distributed with this work for additional information
+ * regarding copyright ownership.  The ASF licenses this file
+ * to you under the Apache License, Version 2.0 (the
+ * License); you may not use this file except in compliance
+ * with the License.  You may obtain a copy of the License at
+ * 
+ *   http://www.apache.org/licenses/LICENSE-2.0
+ * 
+ * Unless required by applicable law or agreed to in writing,
+ * software distributed under the License is distributed on an
+ * AS IS BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY
+ * KIND, either express or implied.  See the License for the
+ * specific language governing permissions and limitations
+ * under the License.
+ * 
+ */
+
+
+import java.util.Map;
+import java.util.concurrent.ExecutorService;
+import java.util.concurrent.Executors;
+import java.util.concurrent.ScheduledExecutorService;
+import java.util.concurrent.ThreadLocalRandom;
+import java.util.concurrent.TimeUnit;
+import java.util.concurrent.atomic.AtomicInteger;
+
+import org.cliffc.high_scale_lib.NonBlockingHashMap;
+import org.junit.Test;
+import org.slf4j.Logger;
+import org.slf4j.LoggerFactory;
+
+import org.apache.cassandra.utils.concurrent.OpOrder;
+
+import static org.junit.Assert.assertTrue;
+
+// TODO: we don't currently test SAFE functionality at all!
+// TODO: should also test markBlocking and SyncOrdered
+public class LongOpOrderTest
+{
+
+private static final Logger logger = 
LoggerFactory.getLogger(LongOpOrderTest.class);
+
+static final int CONSUMERS = 4;
+static final int PRODUCERS = 32;
+
+static final long RUNTIME = TimeUnit.MINUTES.toMillis(5);
+static final long REPORT_INTERVAL = TimeUnit.MINUTES.toMillis(1);
+
+static final Thread.UncaughtExceptionHandler handler = new 
Thread.UncaughtExceptionHandler()
+{
+@Override
+public void uncaughtException(Thread t, Throwable e)
+{
+System.err.println(t.getName() + :  + e.getMessage());
+e.printStackTrace();
+}
+};
+
+final OpOrder order = new OpOrder();
+

[4/6] cassandra git commit: Merge branch 'cassandra-2.1' into cassandra-2.2

2015-06-28 Thread benedict

Merge branch 'cassandra-2.1' into cassandra-2.2

Conflicts:
build.xml


Project: http://git-wip-us.apache.org/repos/asf/cassandra/repo
Commit: http://git-wip-us.apache.org/repos/asf/cassandra/commit/02a7c342
Tree: http://git-wip-us.apache.org/repos/asf/cassandra/tree/02a7c342
Diff: http://git-wip-us.apache.org/repos/asf/cassandra/diff/02a7c342

Branch: refs/heads/trunk
Commit: 02a7c342922a209ac7374f2f425c783a5faf8538
Parents: 14d7a63 bd4a9d1
Author: Benedict Elliott Smith bened...@apache.org
Authored: Sun Jun 28 11:39:53 2015 +0100
Committer: Benedict Elliott Smith bened...@apache.org
Committed: Sun Jun 28 11:39:53 2015 +0100

--

--

[6/6] cassandra git commit: Merge branch 'cassandra-2.2' into trunk

2015-06-28 Thread benedict

Merge branch 'cassandra-2.2' into trunk


Project: http://git-wip-us.apache.org/repos/asf/cassandra/repo
Commit: http://git-wip-us.apache.org/repos/asf/cassandra/commit/3671082b
Tree: http://git-wip-us.apache.org/repos/asf/cassandra/tree/3671082b
Diff: http://git-wip-us.apache.org/repos/asf/cassandra/diff/3671082b

Branch: refs/heads/trunk
Commit: 3671082b037c05979740c9bc5a4ee3a4a4425bf7
Parents: 6739434 02a7c34
Author: Benedict Elliott Smith bened...@apache.org
Authored: Sun Jun 28 11:40:00 2015 +0100
Committer: Benedict Elliott Smith bened...@apache.org
Committed: Sun Jun 28 11:40:00 2015 +0100

--

--

[jira] [Commented] (CASSANDRA-9318) Bound the number of in-flight requests at the coordinator

2015-06-28 Thread Benedict (JIRA)


[ 
https://issues.apache.org/jira/browse/CASSANDRA-9318?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14604662#comment-14604662
 ] 

Benedict commented on CASSANDRA-9318:
-

I'm pretty sure I've made clear a few times that I'm proposing load shedding 
based on _both_ resource consumption and timeout. i.e. if we are running out of 
resources, we hint, if we completely run out of resources, we shed.

In this case, shedding is _never_ incapable of keeping us in a happy place, and 
ensures we absolutely prevent any spam bringing down the server.

I think we need to really separate the two concerns, as we seem to be jumping 
between them: keeping the server alive is best done through shedding; helping 
users with bulk loaders is best served by pausing single clients that are 
exceeding our rate of consumption.

 Bound the number of in-flight requests at the coordinator
 -

 Key: CASSANDRA-9318
 URL: https://issues.apache.org/jira/browse/CASSANDRA-9318
 Project: Cassandra
  Issue Type: Improvement
Reporter: Ariel Weisberg
Assignee: Ariel Weisberg
 Fix For: 2.2.x


 It's possible to somewhat bound the amount of load accepted into the cluster 
 by bounding the number of in-flight requests and request bytes.
 An implementation might do something like track the number of outstanding 
 bytes and requests and if it reaches a high watermark disable read on client 
 connections until it goes back below some low watermark.
 Need to make sure that disabling read on the client connection won't 
 introduce other issues.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (CASSANDRA-8099) Refactor and modernize the storage engine

2015-06-28 Thread Sylvain Lebresne (JIRA)

[
https://issues.apache.org/jira/browse/CASSANDRA-8099?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14604676#comment-14604676
]

Sylvain Lebresne commented on CASSANDRA-8099:
-

I've started it and plan on focusing on it more exclusively this week. I'll add
that I'm quite keen on finishing giving it this first short myself.

Refactor and modernize the storage engine
-

Key: CASSANDRA-8099
URL: https://issues.apache.org/jira/browse/CASSANDRA-8099
Project: Cassandra
Issue Type: Improvement
Reporter: Sylvain Lebresne
Assignee: Sylvain Lebresne
Fix For: 3.0 beta 1

Attachments: 8099-nit

The current storage engine (which for this ticket I'll loosely define as the
code implementing the read/write path) is suffering from old age. One of the
main problem is that the only structure it deals with is the cell, which
completely ignores the more high level CQL structure that groups cell into
(CQL) rows.
This leads to many inefficiencies, like the fact that during a reads we have
to group cells multiple times (to count on replica, then to count on the
coordinator, then to produce the CQL resultset) because we forget about the
grouping right away each time (so lots of useless cell names comparisons in
particular). But outside inefficiencies, having to manually recreate the CQL
structure every time we need it for something is hindering new features and
makes the code more complex that it should be.
Said storage engine also has tons of technical debt. To pick an example, the
fact that during range queries we update {{SliceQueryFilter.count}} is pretty
hacky and error prone. Or the overly complex ways {{AbstractQueryPager}} has
to go into to simply remove the last query result.
So I want to bite the bullet and modernize this storage engine. I propose to
do 2 main things:
# Make the storage engine more aware of the CQL structure. In practice,
instead of having partitions be a simple iterable map of cells, it should be
an iterable list of row (each being itself composed of per-column cells,
though obviously not exactly the same kind of cell we have today).
# Make the engine more iterative. What I mean here is that in the read path,
we end up reading all cells in memory (we put them in a ColumnFamily object),
but there is really no reason to. If instead we were working with iterators
all the way through, we could get to a point where we're basically
transferring data from disk to the network, and we should be able to reduce
GC substantially.
Please note that such refactor should provide some performance improvements
right off the bat but it's not it's primary goal either. It's primary goal is
to simplify the storage engine and adds abstraction that are better suited to
further optimizations.

--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Created] (CASSANDRA-9669) If sstable flushes complete out of order, on restart we can fail to replay necessary commit log records

2015-06-28 Thread Benedict (JIRA)

Benedict created CASSANDRA-9669:
---

 Summary: If sstable flushes complete out of order, on restart we 
can fail to replay necessary commit log records
 Key: CASSANDRA-9669
 URL: https://issues.apache.org/jira/browse/CASSANDRA-9669
 Project: Cassandra
  Issue Type: Bug
  Components: Core
Reporter: Benedict
Priority: Critical


While {{postFlushExecutor}} ensures it never expires CL entries out-of-order, 
on restart we simply take the maximum replay position of any sstable on disk, 
and ignore anything prior. 

It is quite possible for there to be two flushes triggered for a given table, 
and for the second to finish first by virtue of containing a much smaller 
quantity of live data (or perhaps the disk is just under less pressure). If we 
crash before the first sstable has been written, then on restart the data it 
would have represented will disappear, since we will not replay the CL records.

This looks to be a bug present since time immemorial, and also seems pretty 
serious.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Comment Edited] (CASSANDRA-8099) Refactor and modernize the storage engine

2015-06-28 Thread Benedict (JIRA)

[
https://issues.apache.org/jira/browse/CASSANDRA-8099?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14604381#comment-14604381
]

Benedict edited comment on CASSANDRA-8099 at 6/28/15 10:46 AM:
---

[~slebresne]: what's the state of play with the refactor work? Is it being done
in the near future? Trying to figure out if/when I should start making pull
requests for the new memtable hierarchy.

(If it isn't in progress, I'll see about starting the refactor myself and
having you vet it instead)

was (Author: benedict):
[~slebresne]: what's the state of play with the refactor work? Is it being done
in the near future? Trying to figure out if/when I should start making pull
requests for the new memtable hierarchy.

Refactor and modernize the storage engine
-

Key: CASSANDRA-8099
URL: https://issues.apache.org/jira/browse/CASSANDRA-8099
Project: Cassandra
Issue Type: Improvement
Reporter: Sylvain Lebresne
Assignee: Sylvain Lebresne
Fix For: 3.0 beta 1

Attachments: 8099-nit

--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

80 matches

Mail list logo