date:20150629

[jira] [Commented] (CASSANDRA-9318) Bound the number of in-flight requests at the coordinator

2015-06-29 Thread Benedict (JIRA)


[ 
https://issues.apache.org/jira/browse/CASSANDRA-9318?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14605351#comment-14605351
 ] 

Benedict commented on CASSANDRA-9318:
-

Of course, things get even hairier with multi-DC, but I'm not as familiar with 
the logic there. It looks naively that a single node could quickly bring down 
every DC.

 Bound the number of in-flight requests at the coordinator
 -

 Key: CASSANDRA-9318
 URL: https://issues.apache.org/jira/browse/CASSANDRA-9318
 Project: Cassandra
  Issue Type: Improvement
Reporter: Ariel Weisberg
Assignee: Ariel Weisberg
 Fix For: 2.1.x, 2.2.x


 It's possible to somewhat bound the amount of load accepted into the cluster 
 by bounding the number of in-flight requests and request bytes.
 An implementation might do something like track the number of outstanding 
 bytes and requests and if it reaches a high watermark disable read on client 
 connections until it goes back below some low watermark.
 Need to make sure that disabling read on the client connection won't 
 introduce other issues.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (CASSANDRA-9672) Provide a per-table param that would force default ttl on all updates

2015-06-29 Thread Sylvain Lebresne (JIRA)

[
https://issues.apache.org/jira/browse/CASSANDRA-9672?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14605477#comment-14605477
]

Sylvain Lebresne commented on CASSANDRA-9672:
-

I'm bikesheding, but one slightly different alternative could be to add a
{{minimum_data_retention_time}} that would be a guarantee on the minimum time
data is guarantee to live in the database. An advantage being that it would
still allow different ttls (and in particular mixing data with ttl and data
without it) but with the guarantee that you can't fat-finger a ttl too low, or
delete data by mistake, which feels to me closer to what a DBA would intend.
It also have a hunch that it's easier to explain why that kind of option forces
us to refuse deletes, but well, that's just a hunch. It should also be fairly
intuitive to reserve a special value (say negative ones) for that option to
mean keep all data forever.

Of course, such option would still allows us to drop sstables cheaply
(technically as long as the min retention is lower than gcGrace, but you can
lower gcGrace if needs be).

Provide a per-table param that would force default ttl on all updates
-

Key: CASSANDRA-9672
URL: https://issues.apache.org/jira/browse/CASSANDRA-9672
Project: Cassandra
Issue Type: Improvement
Reporter: Aleksey Yeschenko
Priority: Minor

Many users have tables that rely on TTL entirely - no deletes, and only fixed
TTL value.
The way that default ttl works now, we only apply it if none is specified.
We should provide an option that would *enforce* the specified TTL. Not
allowing ttl-less {{INSERT}} or {{UPDATE}}, not allowing ttl that's lower or
higher than the default ttl, and not allowing deletes.
That option when enabled ({{force_default_ttl}}) should allow us to drop more
tables during compaction and do so cheaper. Would also allow the DBAs to
enforce the constraint in a guaranteed manner.

--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (CASSANDRA-9318) Bound the number of in-flight requests at the coordinator

2015-06-29 Thread Benedict (JIRA)


[ 
https://issues.apache.org/jira/browse/CASSANDRA-9318?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14605600#comment-14605600
 ] 

Benedict commented on CASSANDRA-9318:
-

That said, in general (perhaps in a separate ticket) we should probably make 
our heap calculations a bit more robust wrt each other. i.e. we should subtract 
memtable space from any heap apportionment, in case users set memtable space 
really high.

 Bound the number of in-flight requests at the coordinator
 -

 Key: CASSANDRA-9318
 URL: https://issues.apache.org/jira/browse/CASSANDRA-9318
 Project: Cassandra
  Issue Type: Improvement
Reporter: Ariel Weisberg
Assignee: Ariel Weisberg
 Fix For: 2.1.x, 2.2.x


 It's possible to somewhat bound the amount of load accepted into the cluster 
 by bounding the number of in-flight requests and request bytes.
 An implementation might do something like track the number of outstanding 
 bytes and requests and if it reaches a high watermark disable read on client 
 connections until it goes back below some low watermark.
 Need to make sure that disabling read on the client connection won't 
 introduce other issues.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Created] (CASSANDRA-9675) BulkLoader has --transport-factory option but does not use it

2015-06-29 Thread Mike Adamson (JIRA)

Mike Adamson created CASSANDRA-9675:
---

 Summary: BulkLoader has --transport-factory option but does not 
use it
 Key: CASSANDRA-9675
 URL: https://issues.apache.org/jira/browse/CASSANDRA-9675
 Project: Cassandra
  Issue Type: Bug
  Components: Tools
Reporter: Mike Adamson
Assignee: Mike Adamson
 Fix For: 2.2.x


The BulkLoader tool was converted to use the native driver but still has a 
--transport-factory option.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (CASSANDRA-9675) BulkLoader has --transport-factory option but does not use it

2015-06-29 Thread Mike Adamson (JIRA)


 [ 
https://issues.apache.org/jira/browse/CASSANDRA-9675?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Mike Adamson updated CASSANDRA-9675:

Priority: Minor  (was: Major)

 BulkLoader has --transport-factory option but does not use it
 -

 Key: CASSANDRA-9675
 URL: https://issues.apache.org/jira/browse/CASSANDRA-9675
 Project: Cassandra
  Issue Type: Bug
  Components: Tools
Reporter: Mike Adamson
Assignee: Mike Adamson
Priority: Minor
 Fix For: 2.2.x

 Attachments: 9675.txt


 The BulkLoader tool was converted to use the native driver but still has a 
 --transport-factory option.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (CASSANDRA-9675) BulkLoader has --transport-factory option but does not use it

2015-06-29 Thread Mike Adamson (JIRA)


 [ 
https://issues.apache.org/jira/browse/CASSANDRA-9675?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Mike Adamson updated CASSANDRA-9675:

Attachment: 9675.txt

 BulkLoader has --transport-factory option but does not use it
 -

 Key: CASSANDRA-9675
 URL: https://issues.apache.org/jira/browse/CASSANDRA-9675
 Project: Cassandra
  Issue Type: Bug
  Components: Tools
Reporter: Mike Adamson
Assignee: Mike Adamson
 Fix For: 2.2.x

 Attachments: 9675.txt


 The BulkLoader tool was converted to use the native driver but still has a 
 --transport-factory option.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[1/2] cassandra git commit: BulkLoader has --transport-factory option but does not use it

2015-06-29 Thread jasobrown

Repository: cassandra
Updated Branches:
  refs/heads/trunk e211008d5 - 5c31a8633


BulkLoader has --transport-factory option but does not use it

patch by Mike Adamson; reviewed by jasobrown for CASSANDRA-9675


Project: http://git-wip-us.apache.org/repos/asf/cassandra/repo
Commit: http://git-wip-us.apache.org/repos/asf/cassandra/commit/f88b6211
Tree: http://git-wip-us.apache.org/repos/asf/cassandra/tree/f88b6211
Diff: http://git-wip-us.apache.org/repos/asf/cassandra/diff/f88b6211

Branch: refs/heads/trunk
Commit: f88b62118dd9d3f08bc079bc15165fae01519537
Parents: bafcb3a
Author: Jason Brown jasedbr...@gmail.com
Authored: Mon Jun 29 07:10:53 2015 -0700
Committer: Jason Brown jasedbr...@gmail.com
Committed: Mon Jun 29 07:10:53 2015 -0700

--
 CHANGES.txt | 1 +
 src/java/org/apache/cassandra/tools/BulkLoader.java | 4 
 2 files changed, 1 insertion(+), 4 deletions(-)
--


http://git-wip-us.apache.org/repos/asf/cassandra/blob/f88b6211/CHANGES.txt
--
diff --git a/CHANGES.txt b/CHANGES.txt
index e58d524..fe71ea7 100644
--- a/CHANGES.txt
+++ b/CHANGES.txt
@@ -1,4 +1,5 @@
 2.2
+ * BulkLoader has --transport-factory option but does not use it 
(CASSANDRA-9675)
  * Allow JMX over SSL directly from nodetool (CASSANDRA-9090)
  * Update cqlsh for UDFs (CASSANDRA-7556)
  * Change Windows kernel default timer resolution (CASSANDRA-9634)

http://git-wip-us.apache.org/repos/asf/cassandra/blob/f88b6211/src/java/org/apache/cassandra/tools/BulkLoader.java
--
diff --git a/src/java/org/apache/cassandra/tools/BulkLoader.java 
b/src/java/org/apache/cassandra/tools/BulkLoader.java
index 51e5e3d..73194a1 100644
--- a/src/java/org/apache/cassandra/tools/BulkLoader.java
+++ b/src/java/org/apache/cassandra/tools/BulkLoader.java
@@ -24,7 +24,6 @@ import java.net.MalformedURLException;
 import java.net.UnknownHostException;
 import java.util.*;
 
-import com.google.common.base.Optional;
 import com.google.common.collect.HashMultimap;
 import com.google.common.collect.Multimap;
 import org.apache.commons.cli.*;
@@ -53,8 +52,6 @@ public class BulkLoader
 private static final String PASSWD_OPTION = password;
 private static final String THROTTLE_MBITS = throttle;
 
-private static final String TRANSPORT_FACTORY = transport-factory;
-
 /* client encryption options */
 private static final String SSL_TRUSTSTORE = truststore;
 private static final String SSL_TRUSTSTORE_PW = truststore-password;
@@ -516,7 +513,6 @@ public class BulkLoader
 options.addOption(t,  THROTTLE_MBITS, throttle, throttle 
speed in Mbits (default unlimited));
 options.addOption(u,  USER_OPTION, username, username for 
cassandra authentication);
 options.addOption(pw, PASSWD_OPTION, password, password for 
cassandra authentication);
-options.addOption(tf, TRANSPORT_FACTORY, transport factory, 
Fully-qualified ITransportFactory class name for creating a connection to 
cassandra);
 options.addOption(cph, CONNECTIONS_PER_HOST, 
connectionsPerHost, number of concurrent connections-per-host.);
 // ssl connection-related options
 options.addOption(ts, SSL_TRUSTSTORE, TRUSTSTORE, Client SSL: 
full path to truststore);

[jira] [Resolved] (CASSANDRA-8298) cassandra-stress legacy

2015-06-29 Thread T Jake Luciani (JIRA)


 [ 
https://issues.apache.org/jira/browse/CASSANDRA-8298?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

T Jake Luciani resolved CASSANDRA-8298.
---
Resolution: Duplicate

This should happen as part of CASSANDRA-8986

  cassandra-stress legacy
 

 Key: CASSANDRA-8298
 URL: https://issues.apache.org/jira/browse/CASSANDRA-8298
 Project: Cassandra
  Issue Type: Bug
  Components: Tools
 Environment: Centos 6.5 Cassandra 2.1.1
Reporter: Edgardo Vega
Assignee: T Jake Luciani

 Running cassandra-stress legacy failed immediately with a error.
 Running in legacy support mode. Translating command to:
 stress write n=100 -col n=fixed(5) size=fixed(34) data=repeat(1) -rate 
 threads=50 -log interval=10 -mode thrift
 Invalid parameter data=repeat(1)



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Resolved] (CASSANDRA-8597) Stress: make simple things simple

2015-06-29 Thread T Jake Luciani (JIRA)


 [ 
https://issues.apache.org/jira/browse/CASSANDRA-8597?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

T Jake Luciani resolved CASSANDRA-8597.
---
Resolution: Duplicate

Closing in favor of CASSANDRA-8986

 Stress: make simple things simple
 -

 Key: CASSANDRA-8597
 URL: https://issues.apache.org/jira/browse/CASSANDRA-8597
 Project: Cassandra
  Issue Type: Improvement
  Components: Tools
Reporter: Jonathan Ellis
Assignee: T Jake Luciani
 Fix For: 2.1.x


 Some of the trouble people have with stress is a documentation problem, but 
 some is functional.
 Comments from [~iamaleksey]:
 # 3 clustering columns, make a million cells in a single partition, should be 
 simple, but it's not. have to tweak 'clustering' on the three columns just 
 right to make stress work at all. w/ some values it'd just gets stuck forever 
 computing batches
 # for others, it generates huge, megabyte-size batches, utterly disrespecting 
 'select' clause in 'insert'
 #  I want a sequential generator too, to be able to predict deterministic 
 result sets. uniform() only gets you so far
 # impossible to simulate a time series workload
 /cc [~jshook] [~aweisberg] [~benedict]



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (CASSANDRA-8696) nodetool repair on cassandra 2.1.2 keyspaces return java.lang.RuntimeException: Could not create snapshot

2015-06-29 Thread A Markov (JIRA)


[ 
https://issues.apache.org/jira/browse/CASSANDRA-8696?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14605603#comment-14605603
 ] 

A Markov commented on CASSANDRA-8696:
-

Yuki, I am not sure that increasing timeout to 1 hour is a good solution. We 
are using 2.1.7 system and getting into situation that repair totally stops for 
an hour. I might be wrong but it looks like repair doesn't start another 
session until all tasks of a current session are finished one way or another. 
So if one of the tasks of the current session fails without immediate message, 
in our example it is exactly same error about failed snapshot

 RepairJob.java:145 - Error occurred during snapshot phase

repair just idles for an hour resuming it's work after processing that 
exception. As a result of that system could not finish repair in realistic time 
(still working after 7 days).

 nodetool repair on cassandra 2.1.2 keyspaces return 
 java.lang.RuntimeException: Could not create snapshot
 -

 Key: CASSANDRA-8696
 URL: https://issues.apache.org/jira/browse/CASSANDRA-8696
 Project: Cassandra
  Issue Type: Bug
Reporter: Jeff Liu
Assignee: Yuki Morishita
 Fix For: 2.1.x


 When trying to run nodetool repair -pr on cassandra node ( 2.1.2), cassandra 
 throw java exceptions: cannot create snapshot. 
 the error log from system.log:
 {noformat}
 INFO  [STREAM-IN-/10.97.9.110] 2015-01-28 02:07:28,815 
 StreamResultFuture.java:166 - [Stream #692c1450-a692-11e4-9973-070e938df227 
 ID#0] Prepare completed. Receiving 2 files(221187 bytes), sending 5 
 files(632105 bytes)
 INFO  [STREAM-IN-/10.97.9.110] 2015-01-28 02:07:29,046 
 StreamResultFuture.java:180 - [Stream #692c1450-a692-11e4-9973-070e938df227] 
 Session with /10.97.9.110 is complete
 INFO  [STREAM-IN-/10.97.9.110] 2015-01-28 02:07:29,046 
 StreamResultFuture.java:212 - [Stream #692c1450-a692-11e4-9973-070e938df227] 
 All sessions completed
 INFO  [STREAM-IN-/10.97.9.110] 2015-01-28 02:07:29,047 
 StreamingRepairTask.java:96 - [repair #685e3d00-a692-11e4-9973-070e938df227] 
 streaming task succeed, returning response to /10.98.194.68
 INFO  [RepairJobTask:1] 2015-01-28 02:07:29,065 StreamResultFuture.java:86 - 
 [Stream #692c6270-a692-11e4-9973-070e938df227] Executing streaming plan for 
 Repair
 INFO  [StreamConnectionEstablisher:4] 2015-01-28 02:07:29,065 
 StreamSession.java:213 - [Stream #692c6270-a692-11e4-9973-070e938df227] 
 Starting streaming to /10.66.187.201
 INFO  [StreamConnectionEstablisher:4] 2015-01-28 02:07:29,070 
 StreamCoordinator.java:209 - [Stream #692c6270-a692-11e4-9973-070e938df227, 
 ID#0] Beginning stream session with /10.66.187.201
 INFO  [STREAM-IN-/10.66.187.201] 2015-01-28 02:07:29,465 
 StreamResultFuture.java:166 - [Stream #692c6270-a692-11e4-9973-070e938df227 
 ID#0] Prepare completed. Receiving 5 files(627994 bytes), sending 5 
 files(632105 bytes)
 INFO  [StreamReceiveTask:22] 2015-01-28 02:07:31,971 
 StreamResultFuture.java:180 - [Stream #692c6270-a692-11e4-9973-070e938df227] 
 Session with /10.66.187.201 is complete
 INFO  [StreamReceiveTask:22] 2015-01-28 02:07:31,972 
 StreamResultFuture.java:212 - [Stream #692c6270-a692-11e4-9973-070e938df227] 
 All sessions completed
 INFO  [StreamReceiveTask:22] 2015-01-28 02:07:31,972 
 StreamingRepairTask.java:96 - [repair #685e3d00-a692-11e4-9973-070e938df227] 
 streaming task succeed, returning response to /10.98.194.68
 ERROR [RepairJobTask:1] 2015-01-28 02:07:39,444 RepairJob.java:127 - Error 
 occurred during snapshot phase
 java.lang.RuntimeException: Could not create snapshot at /10.97.9.110
 at 
 org.apache.cassandra.repair.SnapshotTask$SnapshotCallback.onFailure(SnapshotTask.java:77)
  ~[apache-cassandra-2.1.2.jar:2.1.2]
 at 
 org.apache.cassandra.net.MessagingService$5$1.run(MessagingService.java:347) 
 ~[apache-cassandra-2.1.2.jar:2.1.2]
 at 
 java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:471) 
 ~[na:1.7.0_45]
 at java.util.concurrent.FutureTask.run(FutureTask.java:262) 
 ~[na:1.7.0_45]
 at 
 java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
  [na:1.7.0_45]
 at 
 java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
  [na:1.7.0_45]
 at java.lang.Thread.run(Thread.java:744) [na:1.7.0_45]
 INFO  [AntiEntropySessions:6] 2015-01-28 02:07:39,445 RepairSession.java:260 
 - [repair #6f85e740-a692-11e4-9973-070e938df227] new session: will sync 
 /10.98.194.68, /10.66.187.201, /10.226.218.135 on range 
 (12817179804668051873746972069086
 2638799,12863540308359254031520865977436165] for events.[bigint0text, 
 bigint0boolean, bigint0int, dataset_catalog, column_categories, 
 bigint0double,

[jira] [Assigned] (CASSANDRA-9601) Allow an initial connection timeout to be set in cqlsh

2015-06-29 Thread Benjamin Lerer (JIRA)


 [ 
https://issues.apache.org/jira/browse/CASSANDRA-9601?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Benjamin Lerer reassigned CASSANDRA-9601:
-

Assignee: Benjamin Lerer  (was: Stefania)

 Allow an initial connection timeout to be set in cqlsh
 --

 Key: CASSANDRA-9601
 URL: https://issues.apache.org/jira/browse/CASSANDRA-9601
 Project: Cassandra
  Issue Type: Improvement
  Components: Core
Reporter: Mike Adamson
Assignee: Benjamin Lerer
  Labels: cqlsh
 Fix For: 2.2.x


 [PYTHON-206|https://datastax-oss.atlassian.net/browse/PYTHON-206] introduced 
 the ability to change the initial connection timeout on connections from the 
 default of 5s.
 This change was introduced because some auth providers (kerberos) can take 
 longer than 5s to complete a first time negotiation for a connection. 
 cqlsh should allow this setting to be changed. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[2/2] cassandra git commit: Merge branch 'cassandra-2.2' into trunk

2015-06-29 Thread jasobrown

Merge branch 'cassandra-2.2' into trunk


Project: http://git-wip-us.apache.org/repos/asf/cassandra/repo
Commit: http://git-wip-us.apache.org/repos/asf/cassandra/commit/5c31a863
Tree: http://git-wip-us.apache.org/repos/asf/cassandra/tree/5c31a863
Diff: http://git-wip-us.apache.org/repos/asf/cassandra/diff/5c31a863

Branch: refs/heads/trunk
Commit: 5c31a8633485493fec392ebcb9d950b57745e456
Parents: e211008 f88b621
Author: Jason Brown jasedbr...@gmail.com
Authored: Mon Jun 29 07:12:13 2015 -0700
Committer: Jason Brown jasedbr...@gmail.com
Committed: Mon Jun 29 07:12:13 2015 -0700

--
 CHANGES.txt | 1 +
 src/java/org/apache/cassandra/tools/BulkLoader.java | 4 
 2 files changed, 1 insertion(+), 4 deletions(-)
--


http://git-wip-us.apache.org/repos/asf/cassandra/blob/5c31a863/CHANGES.txt
--
diff --cc CHANGES.txt
index ff80121,fe71ea7..206b15d
--- a/CHANGES.txt
+++ b/CHANGES.txt
@@@ -1,17 -1,5 +1,18 @@@
 +3.0:
 + * Improve log output from unit tests (CASSANDRA-9528)
 + * Add algorithmic token allocation (CASSANDRA-7032)
 + * Add nodetool command to replay batchlog (CASSANDRA-9547)
 + * Make file buffer cache independent of paths being read (CASSANDRA-8897)
 + * Remove deprecated legacy Hadoop code (CASSANDRA-9353)
 + * Decommissioned nodes will not rejoin the cluster (CASSANDRA-8801)
 + * Change gossip stabilization to use endpoit size (CASSANDRA-9401)
 + * Change default garbage collector to G1 (CASSANDRA-7486)
 + * Populate TokenMetadata early during startup (CASSANDRA-9317)
 + * undeprecate cache recentHitRate (CASSANDRA-6591)
 +
 +
  2.2
+  * BulkLoader has --transport-factory option but does not use it 
(CASSANDRA-9675)
   * Allow JMX over SSL directly from nodetool (CASSANDRA-9090)
   * Update cqlsh for UDFs (CASSANDRA-7556)
   * Change Windows kernel default timer resolution (CASSANDRA-9634)

[jira] [Created] (CASSANDRA-9676) CQLSSTableWriter gives java.lang.AssertionError: Empty partition in C* 2.0.15

2015-06-29 Thread Vladimir Kuptsov (JIRA)

Vladimir Kuptsov created CASSANDRA-9676:
---

 Summary: CQLSSTableWriter gives java.lang.AssertionError: Empty 
partition in C* 2.0.15
 Key: CASSANDRA-9676
 URL: https://issues.apache.org/jira/browse/CASSANDRA-9676
 Project: Cassandra
  Issue Type: Bug
 Environment: cass 2.0.15
Reporter: Vladimir Kuptsov


I've the same issue as described in 
https://issues.apache.org/jira/browse/CASSANDRA-9071
As I can understand it happens during the buffer flush, which regulated with 
the withBufferSizeInMB() method call in
{code} 
CQLSSTableWriter
  .builder()
  .inDirectory(createOutputDir())
  .forTable(metadata.schema)
  .using(insertStatement)
  .withBufferSizeInMB(128)
.build()
{code}
For example, when I use 128 Mb buffer, it fails after 210 000 csv lines 
processed. On 3Mb buffer it fails after 10 000 lines.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (CASSANDRA-9636) Duplicate columns in selection causes AssertionError

2015-06-29 Thread Benjamin Lerer (JIRA)


[ 
https://issues.apache.org/jira/browse/CASSANDRA-9636?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14605548#comment-14605548
 ] 

Benjamin Lerer commented on CASSANDRA-9636:
---

I noticed an issue with {{count(*)}} queries. 
Up to 2.2, the count function was implemented in a different way than the other 
functions. It was some form of hack in {{SelectStatement}}. Due to that the 
mapping returned is wrong for this function.

To be on the safe side, I think it will be good to add some tests for duplicate 
function calls and for 2.2 some tests with aggregations. 


 Duplicate columns in selection causes AssertionError
 

 Key: CASSANDRA-9636
 URL: https://issues.apache.org/jira/browse/CASSANDRA-9636
 Project: Cassandra
  Issue Type: Bug
Reporter: Sam Tunnicliffe
Assignee: Sam Tunnicliffe
 Fix For: 2.1.x, 2.0.x


 Prior to CASSANDRA-9532, unaliased duplicate fields in a selection would be 
 silently ignored. Now, they trigger a server side exception and an unfriendly 
 error response, which we should clean up. Duplicate columns *with* aliases 
 are not affected.
 {code}
 CREATE KEYSPACE ks WITH replication = {'class': 'SimpleStrategy', 
 'replication_factor': 1};
 CREATE TABLE ks.t1 (k int PRIMARY KEY, v int);
 INSERT INTO ks.t2 (k, v) VALUES (0, 0);
 SELECT k, v FROM ks.t2;
 SELECT k, v, v AS other_v FROM ks.t2;
 SELECT k, v, v FROM ks.t2;
 {code}
 The final statement results in this error response  server side stacktrace:
 {code}
 ServerError: ErrorMessage code= [Server error] 
 message=java.lang.AssertionError
 ERROR 13:01:30 Unexpected exception during request; channel = [id: 
 0x44d22e61, /127.0.0.1:39463 = /127.0.0.1:9042]
 java.lang.AssertionError: null
 at org.apache.cassandra.cql3.ResultSet.addRow(ResultSet.java:63) 
 ~[main/:na]
 at 
 org.apache.cassandra.cql3.statements.Selection$ResultSetBuilder.build(Selection.java:355)
  ~[main/:na]
 at 
 org.apache.cassandra.cql3.statements.SelectStatement.process(SelectStatement.java:1226)
  ~[main/:na]
 at 
 org.apache.cassandra.cql3.statements.SelectStatement.processResults(SelectStatement.java:299)
  ~[main/:na]
 at 
 org.apache.cassandra.cql3.statements.SelectStatement.execute(SelectStatement.java:238)
  ~[main/:na]
 at 
 org.apache.cassandra.cql3.statements.SelectStatement.execute(SelectStatement.java:67)
  ~[main/:na]
 at 
 org.apache.cassandra.cql3.QueryProcessor.processStatement(QueryProcessor.java:238)
  ~[main/:na]
 at 
 org.apache.cassandra.cql3.QueryProcessor.process(QueryProcessor.java:260) 
 ~[main/:na]
 at 
 org.apache.cassandra.transport.messages.QueryMessage.execute(QueryMessage.java:119)
  ~[main/:na]
 at 
 org.apache.cassandra.transport.Message$Dispatcher.channelRead0(Message.java:439)
  [main/:na]
 at 
 org.apache.cassandra.transport.Message$Dispatcher.channelRead0(Message.java:335)
  [main/:na]
 at 
 io.netty.channel.SimpleChannelInboundHandler.channelRead(SimpleChannelInboundHandler.java:105)
  [netty-all-4.0.23.Final.jar:4.0.23.Final]
 at 
 io.netty.channel.AbstractChannelHandlerContext.invokeChannelRead(AbstractChannelHandlerContext.java:333)
  [netty-all-4.0.23.Final.jar:4.0.23.Final]
 at 
 io.netty.channel.AbstractChannelHandlerContext.access$700(AbstractChannelHandlerContext.java:32)
  [netty-all-4.0.23.Final.jar:4.0.23.Final]
 at 
 io.netty.channel.AbstractChannelHandlerContext$8.run(AbstractChannelHandlerContext.java:324)
  [netty-all-4.0.23.Final.jar:4.0.23.Final]
 at 
 java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511) 
 [na:1.8.0_45]
 at 
 org.apache.cassandra.concurrent.AbstractTracingAwareExecutorService$FutureTask.run(AbstractTracingAwareExecutorService.java:164)
  [main/:na]
 at org.apache.cassandra.concurrent.SEPWorker.run(SEPWorker.java:105) 
 [main/:na]
 at java.lang.Thread.run(Thread.java:745) [na:1.8.0_45]
 {code}
 This issue also presents on the head of the 2.2 branch and on 2.0.16. 
 However, the prior behaviour is different on both of those branches.
 In the 2.0 line prior to CASSANDRA-9532, duplicate columns would actually be 
 included in the results, as opposed to being silently dropped as per 2.1.x
 In 2.2, the assertion error seen above precedes CASSANDRA-9532 and is also 
 triggered for both aliased and unaliased duplicate columns.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Comment Edited] (CASSANDRA-9606) this query is not supported in new version

2015-06-29 Thread Benjamin Lerer (JIRA)


[ 
https://issues.apache.org/jira/browse/CASSANDRA-9606?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=1460#comment-1460
 ] 

Benjamin Lerer edited comment on CASSANDRA-9606 at 6/29/15 12:48 PM:
-

[~thobbs] could you review?


was (Author: blerer):
@Tyler could you review?

 this query is not supported in new version
 --

 Key: CASSANDRA-9606
 URL: https://issues.apache.org/jira/browse/CASSANDRA-9606
 Project: Cassandra
  Issue Type: Bug
  Components: Core
 Environment: cassandra 2.1.6
 jdk 1.7.0_55
Reporter: zhaoyan
Assignee: Benjamin Lerer
 Attachments: 9606-2.0.txt, 9606-2.1.txt, 9606-2.2.txt


 Background:
 1、create a table:
 {code}
 CREATE TABLE test (
 a int,
 b int,
 c int,
   d int,
 PRIMARY KEY (a, b, c)
 );
 {code}
 2、query by a=1 and b6
 {code}
 select * from test where a=1 and b6;
  a | b | c | d
 ---+---+---+---
  1 | 3 | 1 | 2
  1 | 3 | 2 | 2
  1 | 3 | 4 | 2
  1 | 3 | 5 | 2
  1 | 4 | 4 | 2
  1 | 5 | 5 | 2
 (6 rows)
 {code}
 3、query by page
 first page：
 {code}
 select * from test where a=1 and b6 limit 2;
  a | b | c | d
 ---+---+---+---
  1 | 3 | 1 | 2
  1 | 3 | 2 | 2
 (2 rows)
 {code}
 second page：
 {code}
 select * from test where a=1 and b6 and (b,c)  (3,2) limit 2;
  a | b | c | d
 ---+---+---+---
  1 | 3 | 4 | 2
  1 | 3 | 5 | 2
 (2 rows)
 {code}
 last page：
 {code}
 select * from test where a=1 and b6 and (b,c)  (3,5) limit 2;
  a | b | c | d
 ---+---+---+---
  1 | 4 | 4 | 2
  1 | 5 | 5 | 2
 (2 rows)
 {code}
 question:
 this query by page is ok when cassandra 2.0.8.
 but is not supported in the latest version 2.1.6
 when execute：
 {code}
 select * from test where a=1 and b6 and (b,c)  (3,2) limit 2;
 {code}
 get one error message：
 InvalidRequest: code=2200 [Invalid query] message=Column b cannot have 
 both tuple-notation inequalities and single-column inequalities: (b, c)  (3, 
 2)



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (CASSANDRA-9606) this query is not supported in new version

2015-06-29 Thread Benjamin Lerer (JIRA)


[ 
https://issues.apache.org/jira/browse/CASSANDRA-9606?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=1460#comment-1460
 ] 

Benjamin Lerer commented on CASSANDRA-9606:
---

@Tyler could you review?

 this query is not supported in new version
 --

 Key: CASSANDRA-9606
 URL: https://issues.apache.org/jira/browse/CASSANDRA-9606
 Project: Cassandra
  Issue Type: Bug
  Components: Core
 Environment: cassandra 2.1.6
 jdk 1.7.0_55
Reporter: zhaoyan
Assignee: Benjamin Lerer
 Attachments: 9606-2.0.txt, 9606-2.1.txt, 9606-2.2.txt


 Background:
 1、create a table:
 {code}
 CREATE TABLE test (
 a int,
 b int,
 c int,
   d int,
 PRIMARY KEY (a, b, c)
 );
 {code}
 2、query by a=1 and b6
 {code}
 select * from test where a=1 and b6;
  a | b | c | d
 ---+---+---+---
  1 | 3 | 1 | 2
  1 | 3 | 2 | 2
  1 | 3 | 4 | 2
  1 | 3 | 5 | 2
  1 | 4 | 4 | 2
  1 | 5 | 5 | 2
 (6 rows)
 {code}
 3、query by page
 first page：
 {code}
 select * from test where a=1 and b6 limit 2;
  a | b | c | d
 ---+---+---+---
  1 | 3 | 1 | 2
  1 | 3 | 2 | 2
 (2 rows)
 {code}
 second page：
 {code}
 select * from test where a=1 and b6 and (b,c)  (3,2) limit 2;
  a | b | c | d
 ---+---+---+---
  1 | 3 | 4 | 2
  1 | 3 | 5 | 2
 (2 rows)
 {code}
 last page：
 {code}
 select * from test where a=1 and b6 and (b,c)  (3,5) limit 2;
  a | b | c | d
 ---+---+---+---
  1 | 4 | 4 | 2
  1 | 5 | 5 | 2
 (2 rows)
 {code}
 question:
 this query by page is ok when cassandra 2.0.8.
 but is not supported in the latest version 2.1.6
 when execute：
 {code}
 select * from test where a=1 and b6 and (b,c)  (3,2) limit 2;
 {code}
 get one error message：
 InvalidRequest: code=2200 [Invalid query] message=Column b cannot have 
 both tuple-notation inequalities and single-column inequalities: (b, c)  (3, 
 2)



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (CASSANDRA-9318) Bound the number of in-flight requests at the coordinator

2015-06-29 Thread Benedict (JIRA)

[
https://issues.apache.org/jira/browse/CASSANDRA-9318?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14605597#comment-14605597
]

Benedict commented on CASSANDRA-9318:
-

bq. default timeout is 2s not 10, so actually fine in your example of 300MB vs
150MB/s x 2s

Looks like 2.0 this was 10s, and it was hard-coded in yaml, so anyone upgrading
from 2.0 or before likely has a 10s timeout. So we should assume this is by far
the most common timeout.

bq. you don't see a complete halt until capacity's worth of requests timeout
all at once, because you don't get an entire capacity load accepted at once.
it's more continuous than discrete – you pause until the oldest expire, accept
more, pause until the oldest expire, etc. so you make slow progress as load
shedding can free up memory. thus, load shedding is complementary to flow
control.

You see a complete halt as soon as we exhaust space. If we exhaust space in
0.5x timeout, then we will see repeatedly juddering behaviour.

bq. but we can easily set a higher limit on MS heap – maybe as high as 1/8 heap
as default which gives us a lot of room for 8GB heap

If we set this really _aggressively_ high, say min(1/4 heap, 1Gb) until we
implement the improved shedding, then I'll quit complaining. Right now we give
breathing room up to and beyond collapse. I absolutely agree that breathing
room up until just-prior-to-collapse is preferable, but cutting our breathing
room by a magnitude is reducing our availability in clusters without their
opting into it. 1/4 heap is probably still leaving quite a lot of headroom we
would otherwise have safely used in a 2Gb heap (which are quite feasible, and
probably preferable, for many users running offheap memtables), but is still
very unlikely to cause the server to completely collapse.

Bound the number of in-flight requests at the coordinator
-

Key: CASSANDRA-9318
URL: https://issues.apache.org/jira/browse/CASSANDRA-9318
Project: Cassandra
Issue Type: Improvement
Reporter: Ariel Weisberg
Assignee: Ariel Weisberg
Fix For: 2.1.x, 2.2.x

It's possible to somewhat bound the amount of load accepted into the cluster
by bounding the number of in-flight requests and request bytes.
An implementation might do something like track the number of outstanding
bytes and requests and if it reaches a high watermark disable read on client
connections until it goes back below some low watermark.
Need to make sure that disabling read on the client connection won't
introduce other issues.

--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (CASSANDRA-9601) Allow an initial connection timeout to be set in cqlsh

2015-06-29 Thread Benjamin Lerer (JIRA)


 [ 
https://issues.apache.org/jira/browse/CASSANDRA-9601?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Benjamin Lerer updated CASSANDRA-9601:
--
Assignee: Stefania  (was: Benjamin Lerer)
Reviewer: Benjamin Lerer

 Allow an initial connection timeout to be set in cqlsh
 --

 Key: CASSANDRA-9601
 URL: https://issues.apache.org/jira/browse/CASSANDRA-9601
 Project: Cassandra
  Issue Type: Improvement
  Components: Core
Reporter: Mike Adamson
Assignee: Stefania
  Labels: cqlsh
 Fix For: 2.2.x


 [PYTHON-206|https://datastax-oss.atlassian.net/browse/PYTHON-206] introduced 
 the ability to change the initial connection timeout on connections from the 
 default of 5s.
 This change was introduced because some auth providers (kerberos) can take 
 longer than 5s to complete a first time negotiation for a connection. 
 cqlsh should allow this setting to be changed. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

cassandra git commit: fix idea files issue

2015-06-29 Thread jake

Repository: cassandra
Updated Branches:
  refs/heads/cassandra-2.2 75e85b961 - bafcb3a56


fix idea files issue


Project: http://git-wip-us.apache.org/repos/asf/cassandra/repo
Commit: http://git-wip-us.apache.org/repos/asf/cassandra/commit/bafcb3a5
Tree: http://git-wip-us.apache.org/repos/asf/cassandra/tree/bafcb3a5
Diff: http://git-wip-us.apache.org/repos/asf/cassandra/diff/bafcb3a5

Branch: refs/heads/cassandra-2.2
Commit: bafcb3a5689702b9441c6be1cf4c14fb6caf44f0
Parents: 75e85b9
Author: T Jake Luciani j...@apache.org
Authored: Mon Jun 29 09:52:57 2015 -0400
Committer: T Jake Luciani j...@apache.org
Committed: Mon Jun 29 09:52:57 2015 -0400

--
 build.xml | 5 +
 1 file changed, 1 insertion(+), 4 deletions(-)
--


http://git-wip-us.apache.org/repos/asf/cassandra/blob/bafcb3a5/build.xml
--
diff --git a/build.xml b/build.xml
index 2eb2d89..8ca3122 100644
--- a/build.xml
+++ b/build.xml
@@ -1693,13 +1693,10 @@
path id=idea-project-libs-path
 fileset dir=lib
include name=**/*.jar /
- /fileset
+/fileset
 fileset dir=build/lib/jars
include name=**/*.jar /
 /fileset
-fileset dir=tools/lib
-include name=**/*.jar /
-/fileset
/path
 mkdir dir=.idea/
 mkdir dir=.idea/libraries/

[jira] [Commented] (CASSANDRA-9318) Bound the number of in-flight requests at the coordinator

2015-06-29 Thread Jonathan Ellis (JIRA)


[ 
https://issues.apache.org/jira/browse/CASSANDRA-9318?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14605629#comment-14605629
 ] 

Jonathan Ellis commented on CASSANDRA-9318:
---

bq. If we set this really aggressively high, say min(1/4 heap, 1Gb) until we 
implement the improved shedding, then I'll quit complaining. 

Sold!

 Bound the number of in-flight requests at the coordinator
 -

 Key: CASSANDRA-9318
 URL: https://issues.apache.org/jira/browse/CASSANDRA-9318
 Project: Cassandra
  Issue Type: Improvement
Reporter: Ariel Weisberg
Assignee: Ariel Weisberg
 Fix For: 2.1.x, 2.2.x


 It's possible to somewhat bound the amount of load accepted into the cluster 
 by bounding the number of in-flight requests and request bytes.
 An implementation might do something like track the number of outstanding 
 bytes and requests and if it reaches a high watermark disable read on client 
 connections until it goes back below some low watermark.
 Need to make sure that disabling read on the client connection won't 
 introduce other issues.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

cassandra git commit: BulkLoader has --transport-factory option but does not use it

2015-06-29 Thread jasobrown

Repository: cassandra
Updated Branches:
  refs/heads/cassandra-2.2 bafcb3a56 - f88b62118


BulkLoader has --transport-factory option but does not use it

patch by Mike Adamson; reviewed by jasobrown for CASSANDRA-9675


Project: http://git-wip-us.apache.org/repos/asf/cassandra/repo
Commit: http://git-wip-us.apache.org/repos/asf/cassandra/commit/f88b6211
Tree: http://git-wip-us.apache.org/repos/asf/cassandra/tree/f88b6211
Diff: http://git-wip-us.apache.org/repos/asf/cassandra/diff/f88b6211

Branch: refs/heads/cassandra-2.2
Commit: f88b62118dd9d3f08bc079bc15165fae01519537
Parents: bafcb3a
Author: Jason Brown jasedbr...@gmail.com
Authored: Mon Jun 29 07:10:53 2015 -0700
Committer: Jason Brown jasedbr...@gmail.com
Committed: Mon Jun 29 07:10:53 2015 -0700

--
 CHANGES.txt | 1 +
 src/java/org/apache/cassandra/tools/BulkLoader.java | 4 
 2 files changed, 1 insertion(+), 4 deletions(-)
--


http://git-wip-us.apache.org/repos/asf/cassandra/blob/f88b6211/CHANGES.txt
--
diff --git a/CHANGES.txt b/CHANGES.txt
index e58d524..fe71ea7 100644
--- a/CHANGES.txt
+++ b/CHANGES.txt
@@ -1,4 +1,5 @@
 2.2
+ * BulkLoader has --transport-factory option but does not use it 
(CASSANDRA-9675)
  * Allow JMX over SSL directly from nodetool (CASSANDRA-9090)
  * Update cqlsh for UDFs (CASSANDRA-7556)
  * Change Windows kernel default timer resolution (CASSANDRA-9634)

http://git-wip-us.apache.org/repos/asf/cassandra/blob/f88b6211/src/java/org/apache/cassandra/tools/BulkLoader.java
--
diff --git a/src/java/org/apache/cassandra/tools/BulkLoader.java 
b/src/java/org/apache/cassandra/tools/BulkLoader.java
index 51e5e3d..73194a1 100644
--- a/src/java/org/apache/cassandra/tools/BulkLoader.java
+++ b/src/java/org/apache/cassandra/tools/BulkLoader.java
@@ -24,7 +24,6 @@ import java.net.MalformedURLException;
 import java.net.UnknownHostException;
 import java.util.*;
 
-import com.google.common.base.Optional;
 import com.google.common.collect.HashMultimap;
 import com.google.common.collect.Multimap;
 import org.apache.commons.cli.*;
@@ -53,8 +52,6 @@ public class BulkLoader
 private static final String PASSWD_OPTION = password;
 private static final String THROTTLE_MBITS = throttle;
 
-private static final String TRANSPORT_FACTORY = transport-factory;
-
 /* client encryption options */
 private static final String SSL_TRUSTSTORE = truststore;
 private static final String SSL_TRUSTSTORE_PW = truststore-password;
@@ -516,7 +513,6 @@ public class BulkLoader
 options.addOption(t,  THROTTLE_MBITS, throttle, throttle 
speed in Mbits (default unlimited));
 options.addOption(u,  USER_OPTION, username, username for 
cassandra authentication);
 options.addOption(pw, PASSWD_OPTION, password, password for 
cassandra authentication);
-options.addOption(tf, TRANSPORT_FACTORY, transport factory, 
Fully-qualified ITransportFactory class name for creating a connection to 
cassandra);
 options.addOption(cph, CONNECTIONS_PER_HOST, 
connectionsPerHost, number of concurrent connections-per-host.);
 // ssl connection-related options
 options.addOption(ts, SSL_TRUSTSTORE, TRUSTSTORE, Client SSL: 
full path to truststore);

[1/2] cassandra git commit: fix idea files issue

2015-06-29 Thread jake

Repository: cassandra
Updated Branches:
  refs/heads/trunk 4129c0b00 - e211008d5


fix idea files issue


Project: http://git-wip-us.apache.org/repos/asf/cassandra/repo
Commit: http://git-wip-us.apache.org/repos/asf/cassandra/commit/bafcb3a5
Tree: http://git-wip-us.apache.org/repos/asf/cassandra/tree/bafcb3a5
Diff: http://git-wip-us.apache.org/repos/asf/cassandra/diff/bafcb3a5

Branch: refs/heads/trunk
Commit: bafcb3a5689702b9441c6be1cf4c14fb6caf44f0
Parents: 75e85b9
Author: T Jake Luciani j...@apache.org
Authored: Mon Jun 29 09:52:57 2015 -0400
Committer: T Jake Luciani j...@apache.org
Committed: Mon Jun 29 09:52:57 2015 -0400

--
 build.xml | 5 +
 1 file changed, 1 insertion(+), 4 deletions(-)
--


http://git-wip-us.apache.org/repos/asf/cassandra/blob/bafcb3a5/build.xml
--
diff --git a/build.xml b/build.xml
index 2eb2d89..8ca3122 100644
--- a/build.xml
+++ b/build.xml
@@ -1693,13 +1693,10 @@
path id=idea-project-libs-path
 fileset dir=lib
include name=**/*.jar /
- /fileset
+/fileset
 fileset dir=build/lib/jars
include name=**/*.jar /
 /fileset
-fileset dir=tools/lib
-include name=**/*.jar /
-/fileset
/path
 mkdir dir=.idea/
 mkdir dir=.idea/libraries/

[2/2] cassandra git commit: Merge branch 'cassandra-2.2' into trunk

2015-06-29 Thread jake

Merge branch 'cassandra-2.2' into trunk


Project: http://git-wip-us.apache.org/repos/asf/cassandra/repo
Commit: http://git-wip-us.apache.org/repos/asf/cassandra/commit/e211008d
Tree: http://git-wip-us.apache.org/repos/asf/cassandra/tree/e211008d
Diff: http://git-wip-us.apache.org/repos/asf/cassandra/diff/e211008d

Branch: refs/heads/trunk
Commit: e211008d5a761e773b8740bdbada21bdcad035c1
Parents: 4129c0b bafcb3a
Author: T Jake Luciani j...@apache.org
Authored: Mon Jun 29 09:53:46 2015 -0400
Committer: T Jake Luciani j...@apache.org
Committed: Mon Jun 29 09:53:46 2015 -0400

--
 build.xml | 5 +
 1 file changed, 1 insertion(+), 4 deletions(-)
--


http://git-wip-us.apache.org/repos/asf/cassandra/blob/e211008d/build.xml
--

[jira] [Commented] (CASSANDRA-9672) Provide a per-table param that would force default ttl on all updates

2015-06-29 Thread Aleksey Yeschenko (JIRA)


[ 
https://issues.apache.org/jira/browse/CASSANDRA-9672?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14605484#comment-14605484
 ] 

Aleksey Yeschenko commented on CASSANDRA-9672:
--

Yeah, that could work as well.

That said, a forced single TTL strictness guarantees us that there no 
overwrites of the same cells with a lower TTL ever happens, and that *should* 
allow us to optimize harder.

We probably could/should provide several different options to hint the usage 
patterns, with varying degrees of strictness.

 Provide a per-table param that would force default ttl on all updates
 -

 Key: CASSANDRA-9672
 URL: https://issues.apache.org/jira/browse/CASSANDRA-9672
 Project: Cassandra
  Issue Type: Improvement
Reporter: Aleksey Yeschenko
Priority: Minor

 Many users have tables that rely on TTL entirely - no deletes, and only fixed 
 TTL value.
 The way that default ttl works now, we only apply it if none is specified.
 We should provide an option that would *enforce* the specified TTL. Not 
 allowing ttl-less {{INSERT}} or {{UPDATE}}, not allowing ttl that's lower or 
 higher than the default ttl, and not allowing deletes.
 That option when enabled ({{force_default_ttl}}) should allow us to drop more 
 tables during compaction and do so cheaper. Would also allow the DBAs to 
 enforce the constraint in a guaranteed manner.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (CASSANDRA-9635) Silent startup failure with filesystem that does not support mmap

2015-06-29 Thread Branimir Lambov (JIRA)


[ 
https://issues.apache.org/jira/browse/CASSANDRA-9635?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14605418#comment-14605418
 ] 

Branimir Lambov commented on CASSANDRA-9635:


Is the commit log failure policy not working correctly, i.e. does the node fail 
to reach the state it is supposed to after this happens? I believe a deadlock 
waiting for allocation is to be expected as a result of the 'stop' policy.

Perhaps a good simple solution may be for the policy to start as 'die' until we 
have done one allocation.

 Silent startup failure with filesystem that does not support mmap
 -

 Key: CASSANDRA-9635
 URL: https://issues.apache.org/jira/browse/CASSANDRA-9635
 Project: Cassandra
  Issue Type: Bug
  Components: Core
Reporter: Kevin McLaughlin
Assignee: Stefania
 Fix For: 2.0.x

 Attachments: c_tdump.txt


 C* version 2.0.9.
 When running C* in virtualbox on OS X via boot2docker with the data directory 
 on a shared volume from the host system (vboxfs), C* fails to start without 
 printing any errors.
 I do not know if C* is supposed to support filesystems that do not support 
 mmap (does not appear so), however, I think the failure exposes a static 
 initialization deadlock 
 (http://ternarysearch.blogspot.ru/2013/07/static-initialization-deadlock.html).
 I believe the virtualbox bug is https://www.virtualbox.org/ticket/819.
 Stacktrace of the deadlock is attached.  When placing a t.printStackTrace() 
 between lines 115 and 116 in 
 https://github.com/apache/cassandra/blob/cassandra-2.0.9/src/java/org/apache/cassandra/db/commitlog/CommitLogAllocator.java,
  the stack trace at startup is:
 {quote}
 DEBUG 21:16:54,716 Creating new commit log segment 
 /var/lib/cassandra/commitlog/CommitLog-3-1435007814714.log
 FSWriteError in /var/lib/cassandra/commitlog/CommitLog-3-1435007814714.log
 at 
 org.apache.cassandra.db.commitlog.CommitLogSegment.init(CommitLogSegment.java:143)
 at 
 org.apache.cassandra.db.commitlog.CommitLogSegment.freshSegment(CommitLogSegment.java:90)
 at 
 org.apache.cassandra.db.commitlog.CommitLogAllocator.createFreshSegment(CommitLogAllocator.java:263)
 at 
 org.apache.cassandra.db.commitlog.CommitLogAllocator.access$500(CommitLogAllocator.java:50)
 at 
 org.apache.cassandra.db.commitlog.CommitLogAllocator$1.runMayThrow(CommitLogAllocator.java:109)
 at org.apache.cassandra.utils.WrappedRunnable.run(WrappedRunnable.java:28)
 at java.lang.Thread.run(Thread.java:745)
 Caused by: java.io.IOException: Invalid argument
 at sun.nio.ch.FileChannelImpl.map0(Native Method)
 at sun.nio.ch.FileChannelImpl.map(FileChannelImpl.java:893)
 at 
 org.apache.cassandra.db.commitlog.CommitLogSegment.init(CommitLogSegment.java:133)
 ... 6 more
 {quote}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (CASSANDRA-9635) Silent startup failure with filesystem that does not support mmap

2015-06-29 Thread Stefania (JIRA)


[ 
https://issues.apache.org/jira/browse/CASSANDRA-9635?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14605485#comment-14605485
 ] 

Stefania commented on CASSANDRA-9635:
-

Thanks for your input. CommitFailurePolicy in 2.0 has only these values:

{code}
public static enum CommitFailurePolicy
{
stop,
stop_commit,
ignore,
}
{code}

I only tested stop but I don't think the other ones would be any different. 
When was the die policy introduced and shall I back port it? I can also set it 
to 'die' until we have the first segment, on all branches that it.

 Silent startup failure with filesystem that does not support mmap
 -

 Key: CASSANDRA-9635
 URL: https://issues.apache.org/jira/browse/CASSANDRA-9635
 Project: Cassandra
  Issue Type: Bug
  Components: Core
Reporter: Kevin McLaughlin
Assignee: Stefania
 Fix For: 2.0.x

 Attachments: c_tdump.txt


 C* version 2.0.9.
 When running C* in virtualbox on OS X via boot2docker with the data directory 
 on a shared volume from the host system (vboxfs), C* fails to start without 
 printing any errors.
 I do not know if C* is supposed to support filesystems that do not support 
 mmap (does not appear so), however, I think the failure exposes a static 
 initialization deadlock 
 (http://ternarysearch.blogspot.ru/2013/07/static-initialization-deadlock.html).
 I believe the virtualbox bug is https://www.virtualbox.org/ticket/819.
 Stacktrace of the deadlock is attached.  When placing a t.printStackTrace() 
 between lines 115 and 116 in 
 https://github.com/apache/cassandra/blob/cassandra-2.0.9/src/java/org/apache/cassandra/db/commitlog/CommitLogAllocator.java,
  the stack trace at startup is:
 {quote}
 DEBUG 21:16:54,716 Creating new commit log segment 
 /var/lib/cassandra/commitlog/CommitLog-3-1435007814714.log
 FSWriteError in /var/lib/cassandra/commitlog/CommitLog-3-1435007814714.log
 at 
 org.apache.cassandra.db.commitlog.CommitLogSegment.init(CommitLogSegment.java:143)
 at 
 org.apache.cassandra.db.commitlog.CommitLogSegment.freshSegment(CommitLogSegment.java:90)
 at 
 org.apache.cassandra.db.commitlog.CommitLogAllocator.createFreshSegment(CommitLogAllocator.java:263)
 at 
 org.apache.cassandra.db.commitlog.CommitLogAllocator.access$500(CommitLogAllocator.java:50)
 at 
 org.apache.cassandra.db.commitlog.CommitLogAllocator$1.runMayThrow(CommitLogAllocator.java:109)
 at org.apache.cassandra.utils.WrappedRunnable.run(WrappedRunnable.java:28)
 at java.lang.Thread.run(Thread.java:745)
 Caused by: java.io.IOException: Invalid argument
 at sun.nio.ch.FileChannelImpl.map0(Native Method)
 at sun.nio.ch.FileChannelImpl.map(FileChannelImpl.java:893)
 at 
 org.apache.cassandra.db.commitlog.CommitLogSegment.init(CommitLogSegment.java:133)
 ... 6 more
 {quote}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (CASSANDRA-9665) Improve handling of UDF and UDA metadata

2015-06-29 Thread Robert Stupp (JIRA)


[ 
https://issues.apache.org/jira/browse/CASSANDRA-9665?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14605527#comment-14605527
 ] 

Robert Stupp commented on CASSANDRA-9665:
-

+1 - feel free to commit :)

 Improve handling of UDF and UDA metadata
 

 Key: CASSANDRA-9665
 URL: https://issues.apache.org/jira/browse/CASSANDRA-9665
 Project: Cassandra
  Issue Type: Sub-task
Reporter: Aleksey Yeschenko
Assignee: Aleksey Yeschenko
 Fix For: 3.0 beta 1


 A while ago we decided to make all functions and types keyspace local, but 
 haven't updated our assumption in the code accordingly.
 One consequence is that in addition to {{Schema}} and {{KSMetaData}} we got 
 ourselves a completely separate registry singleton for built-in functions, 
 UDFs, and UDAs - the {{Functions}} class.
 The linked branch makes UDAs and UDFs be a part of {{KSMetaData}}, as they 
 should be, and gets rid of the old {{Functions}} class.
 A new {{Functions}} class is introduced - an immutable container for a given 
 keyspace's functions, and all the definitions are now spread between the 
 keyspaces.
 Additionally, this moves all the built-in functions to {{SystemKeyspace}}. 
 This sneaks in a bit of {{CASSANDRA-9425}}, makes {{CASSANDRA-9367}} easier, 
 and is a minore pre-requisite for a proper implementation of 
 {{CASSANDRA-6717}}.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Created] (CASSANDRA-9674) Reevaluate size of result/accumulator types of built in sum()+avg() functions

2015-06-29 Thread Robert Stupp (JIRA)

Robert Stupp created CASSANDRA-9674:
---

 Summary: Reevaluate size of result/accumulator types of built in 
sum()+avg() functions
 Key: CASSANDRA-9674
 URL: https://issues.apache.org/jira/browse/CASSANDRA-9674
 Project: Cassandra
  Issue Type: Improvement
Reporter: Robert Stupp
 Fix For: 2.2.x


I'd like to propose to enlarge the accumulator and result type. Reason is 
simply that an integer overflow is likely to occur especially for these 
narrow types. Even just the {{sum()}} of just two {{tinyint}} of {{100}} 
would return {{-56}}, which is just wrong.

Probably like 
[this|http://www.postgresql.org/docs/9.1/static/functions-aggregate.html].

If we decide to do so, we should do it in 2.2.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (CASSANDRA-9318) Bound the number of in-flight requests at the coordinator

2015-06-29 Thread Jonathan Ellis (JIRA)


[ 
https://issues.apache.org/jira/browse/CASSANDRA-9318?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14605539#comment-14605539
 ] 

Jonathan Ellis commented on CASSANDRA-9318:
---

* default timeout is 2s not 10, so actually fine in your example of 300MB vs 
150MB/s  x 2s
* but we can easily set a higher limit on MS heap -- maybe as high as 1/8 heap 
as default which gives us a *lot* of room for 8GB heap
* you don't see a complete halt until capacity's worth of requests timeout all 
at once, because you don't get an entire capacity load accepted at once.  it's 
more continuous than discrete -- you pause until the oldest expire, accept 
more, pause until the oldest expire, etc.  so you make slow progress as load 
shedding can free up memory.  thus, load shedding is complementary to flow 
control.
* aggressively load shedding for outlier nodes is a good idea that we should 
follow up on in another ticket.  again, current behavior of continuing to 
accept requests until we fall over is worse than imposing flow control, so we 
should start with that [flow control] in 2.1 and make further improvements in 
2.2.

 Bound the number of in-flight requests at the coordinator
 -

 Key: CASSANDRA-9318
 URL: https://issues.apache.org/jira/browse/CASSANDRA-9318
 Project: Cassandra
  Issue Type: Improvement
Reporter: Ariel Weisberg
Assignee: Ariel Weisberg
 Fix For: 2.1.x, 2.2.x


 It's possible to somewhat bound the amount of load accepted into the cluster 
 by bounding the number of in-flight requests and request bytes.
 An implementation might do something like track the number of outstanding 
 bytes and requests and if it reaches a high watermark disable read on client 
 connections until it goes back below some low watermark.
 Need to make sure that disabling read on the client connection won't 
 introduce other issues.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (CASSANDRA-9676) CQLSSTableWriter gives java.lang.AssertionError: Empty partition in C* 2.0.15

2015-06-29 Thread Vladimir Kuptsov (JIRA)


 [ 
https://issues.apache.org/jira/browse/CASSANDRA-9676?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Vladimir Kuptsov updated CASSANDRA-9676:

Description: 
I've the same issue as described in 
https://issues.apache.org/jira/browse/CASSANDRA-9071
As I can understand it happens during the buffer flush, which size regulated by 
the withBufferSizeInMB() method call in
{code} 
CQLSSTableWriter
  .builder()
  .inDirectory(createOutputDir())
  .forTable(metadata.schema)
  .using(insertStatement)
  .withBufferSizeInMB(128)
.build()
{code}
For example, when I use 128 Mb buffer, it fails after 210 000 csv lines 
processed. On 3Mb buffer it fails after 10 000 lines.

  was:
I've the same issue as described in 
https://issues.apache.org/jira/browse/CASSANDRA-9071
As I can understand it happens during the buffer flush, which regulated with 
the withBufferSizeInMB() method call in
{code} 
CQLSSTableWriter
  .builder()
  .inDirectory(createOutputDir())
  .forTable(metadata.schema)
  .using(insertStatement)
  .withBufferSizeInMB(128)
.build()
{code}
For example, when I use 128 Mb buffer, it fails after 210 000 csv lines 
processed. On 3Mb buffer it fails after 10 000 lines.


 CQLSSTableWriter gives java.lang.AssertionError: Empty partition in C* 2.0.15
 -

 Key: CASSANDRA-9676
 URL: https://issues.apache.org/jira/browse/CASSANDRA-9676
 Project: Cassandra
  Issue Type: Bug
 Environment: cass 2.0.15
Reporter: Vladimir Kuptsov

 I've the same issue as described in 
 https://issues.apache.org/jira/browse/CASSANDRA-9071
 As I can understand it happens during the buffer flush, which size regulated 
 by the withBufferSizeInMB() method call in
 {code} 
 CQLSSTableWriter
   .builder()
   .inDirectory(createOutputDir())
   .forTable(metadata.schema)
   .using(insertStatement)
   .withBufferSizeInMB(128)
 .build()
 {code}
 For example, when I use 128 Mb buffer, it fails after 210 000 csv lines 
 processed. On 3Mb buffer it fails after 10 000 lines.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (CASSANDRA-8576) Primary Key Pushdown For Hadoop

2015-06-29 Thread Jeremy Hanna (JIRA)


[ 
https://issues.apache.org/jira/browse/CASSANDRA-8576?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14605771#comment-14605771
 ] 

Jeremy Hanna commented on CASSANDRA-8576:
-

Is there anything else that needs to happen on this before committing?

 Primary Key Pushdown For Hadoop
 ---

 Key: CASSANDRA-8576
 URL: https://issues.apache.org/jira/browse/CASSANDRA-8576
 Project: Cassandra
  Issue Type: Improvement
  Components: Hadoop
Reporter: Russell Alexander Spitzer
Assignee: Alex Liu
 Fix For: 2.1.x

 Attachments: 8576-2.1-branch.txt, 8576-trunk.txt, 
 CASSANDRA-8576-v2-2.1-branch.txt, CASSANDRA-8576-v3-2.1-branch.txt


 I've heard reports from several users that they would like to have predicate 
 pushdown functionality for hadoop (Hive in particular) based services. 
 Example usecase
 Table with wide partitions, one per customer
 Application team has HQL they would like to run on a single customer
 Currently time to complete scales with number of customers since Input Format 
 can't pushdown primary key predicate
 Current implementation requires a full table scan (since it can't recognize 
 that a single partition was specified)



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Created] (CASSANDRA-9677) Refactor KSMetaData

2015-06-29 Thread Aleksey Yeschenko (JIRA)

Aleksey Yeschenko created CASSANDRA-9677:


 Summary: Refactor KSMetaData
 Key: CASSANDRA-9677
 URL: https://issues.apache.org/jira/browse/CASSANDRA-9677
 Project: Cassandra
  Issue Type: Improvement
Reporter: Aleksey Yeschenko
Assignee: Aleksey Yeschenko
 Fix For: 3.x


As part of CASSANDRA-9425 and a follow-up to CASSANDRA-9665, and a 
pre-requisite for new schema change protocol, this ticket will do the following

1. Make {{UTMetaData}} immutable (new {{Types}} class)
2. Refactor handling of the {{CFMetaData}} map in {{KSMetaData}} (new 
{{Tables}} class)
3. Factor out params into a separate class ({{KeyspaceParams}})
4. Rename and move {{KSMetaData}} to {{schema.KeyspaceMetadata}}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (CASSANDRA-9658) Re-enable memory-mapped index file reads on Windows

2015-06-29 Thread Joshua McKenzie (JIRA)


[ 
https://issues.apache.org/jira/browse/CASSANDRA-9658?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14605763#comment-14605763
 ] 

Joshua McKenzie commented on CASSANDRA-9658:


[~iamaleksey]: My initial assumption is that getting buffered close to parity 
w/mmap on Windows is going to be both much more programmer-hour intensive and 
much more invasive than getting mmap stabilized on Windows in time for 2.2.x 
stabilizing. I agree on the long-term goal of standardizing on a single read 
path; I'll do some stress-testing today to get an initial read on how much pain 
enabling mmap'ed I/O on Windows might cause us.

[~stefania_alborghetti]: I don't think 7066 will actually be necessary for us 
after CASSANDRA-8535 and then CASSANDRA-8984, however I'll need to stress test 
the paths today to get a better feel for it post 8984. Let's sit tight on these 
test results w/mmap on Windows before taking any other steps to try and get 
buffered reads closer to parity right now on account of this ticket.

 Re-enable memory-mapped index file reads on Windows
 ---

 Key: CASSANDRA-9658
 URL: https://issues.apache.org/jira/browse/CASSANDRA-9658
 Project: Cassandra
  Issue Type: Improvement
Reporter: Joshua McKenzie
Assignee: Joshua McKenzie
  Labels: Windows, performance
 Fix For: 2.2.x


 It appears that the impact of buffered vs. memory-mapped index file reads has 
 changed dramatically since last I tested. [Here's some results on various 
 platforms we pulled together yesterday 
 w/2.2-HEAD|https://docs.google.com/spreadsheets/d/1JaO2x7NsK4SSg_ZBqlfH0AwspGgIgFZ9wZ12fC4VZb0/edit#gid=0].
 TL;DR: On linux we see a 40% hit in performance from 108k ops/sec on reads to 
 64.8k ops/sec. While surprising in itself, the really unexpected result (to 
 me) is on Windows - with standard access we're getting 16.8k ops/second on 
 our bare-metal perf boxes vs. 184.7k ops/sec with memory-mapped index files, 
 an over 10-fold increase in throughput. While testing w/standard access, 
 CPU's on the stress machine and C* node are both sitting  4%, network 
 doesn't appear bottlenecked, resource monitor doesn't show anything 
 interesting, and performance counters in the kernel show very little. Changes 
 in thread count simply serve to increase median latency w/out impacting any 
 other visible metric that we're measuring, so I'm at a loss as to why the 
 disparity is so huge on the platform.
 The combination of my changes to get the 2.1 branch to behave on Windows 
 along with [~benedict] and [~Stefania]'s changes in lifecycle and cleanup 
 patterns on 2.2 should hopefully have us in a state where transitioning back 
 to using memory-mapped I/O on Windows will only cause trouble on snapshot 
 deletion. Fairly simple runs of stress w/compaction aren't popping up any 
 obvious errors on file access or renaming - I'm going to do some much heavier 
 testing (ccm multi-node clusters, long stress w/repair and compaction, etc) 
 and see if there's any outstanding issues that need to be stamped out to call 
 mmap'ed index files on Windows safe. The one thing we'll never be able to 
 support is deletion of snapshots while a node is running and sstables are 
 mapped, but for a  10x throughput increase I think users would be willing to 
 make that sacrifice.
 The combination of the powercfg profile change, the kernel timer resolution, 
 and memory-mapped index files are giving some pretty interesting performance 
 numbers on EC2.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

1 2 >

1 - 100 of 169 matches

Mail list logo