How to clear all data using CQL?

2014-04-16 Thread Sebastian Schmidt
Hey I want to do unit tests for software that uses Cassandra as storage backend. Using the Hector client, I could just iterate over all keyspaces and delete them. In CQL, there is no query to list all keyspaces. How can I drop all Keyspaces from Cassandra, without knowing every single Keyspace

Re: How to clear all data using CQL?

2014-04-16 Thread Mark Reddy
To get a list of all keyspaces via CQL you can run: *describe keyspaces *or *select keyspace_name from system.schema_keyspaces;* On Wed, Apr 16, 2014 at 9:23 AM, Sebastian Schmidt isib...@gmail.comwrote: Hey I want to do unit tests for software that uses Cassandra as storage backend. Using

Re: Replication Factor question

2014-04-16 Thread Markus Jais
Hi Rob, thanks. How many nodes to you have running in those 5 racks and RF 5? Only 5 nodes or more? Markus Robert Coli rc...@eventbrite.com schrieb am 20:36 Dienstag, 15.April 2014: On Tue, Apr 15, 2014 at 6:14 AM, Ken Hancock ken.hanc...@schange.com wrote: Keep in mind if you lose the wrong

Re: How to clear all data using CQL?

2014-04-16 Thread Sebastian Schmidt
Thank you that worked! Am 16.04.2014 10:46, schrieb Mark Reddy: /select keyspace_name from system.schema_keyspaces;/

Deleting column names

2014-04-16 Thread Sebastian Schmidt
Hi, I'm using a Cassandra table to store some data. I created the table like this: CREATE TABLE IF NOT EXISTS table_name (s BLOB, p BLOB, o BLOB, c BLOB, PRIMARY KEY (s, p, o, c)); I need the at least the p column to be sorted, so that I can use it in a WHERE clause. So as far as I understand,

How safe is nodetool move in 1.2 ?

2014-04-16 Thread Oleg Dulin
I need to rebalance my cluster. I am sure this question has been asked before -- will 1.2 continue to serve reads and writes correctly while move is in progress ? Need this for my sanity. -- Regards, Oleg Dulin http://www.olegdulin.com

Re: How safe is nodetool move in 1.2 ?

2014-04-16 Thread Jonathan Lacefield
Assuming you have enough nodes not undergoing move to meet your CL requirements, then yes, your cluster will still accept reads and writes. However, it's always good to test this before doing it in production to ensure your cluster and app will function as designed. Jonathan Lacefield Solutions

CqlPagingInputFormat: paging through wide rows

2014-04-16 Thread Paolo Estrella
Hello, I've just upgraded to Cassandra 1.2.16. I've also started using the CqlPagingInputFormat within my map/reduce tasks. I have a question with regard to using CqlPagingInputFormat for paging through wide rows. I don't see a way to input more than one column at a time into my Mapper. I

Re: Unable to complete request: one or more nodes were unavailable.

2014-04-16 Thread Vivek Mishra
Hi, Mine is a simple case. Running on single node only. Keyspace is: create keyspace twitter with replication = {'class':'SimpleStrategy', 'replication_factor' : 3} -Vivek On Wed, Apr 16, 2014 at 1:27 AM, Tupshin Harper tups...@tupshin.com wrote: Please provide your keyspace definition, and

Re: Unable to complete request: one or more nodes were unavailable.

2014-04-16 Thread Mark Reddy
create keyspace twitter with replication = {'class':'SimpleStrategy', 'replication_factor' : 3} Your replication factor is your issue here, you have a single node and a RF=3. For a single node setup your RF should be 1. You can find more info about replication here:

Re: Unable to complete request: one or more nodes were unavailable.

2014-04-16 Thread Tupshin Harper
That is entirely the problem. By having one node, but RF=3, you have created this exact problem. Each node can own, at most, one copy of each row of data, so the cluster is treating this (correctly) as a degraded runtime environment where no operation requiring CL 1 will always fail. LWT

Re: Unable to complete request: one or more nodes were unavailable.

2014-04-16 Thread Vivek Mishra
Thanks Mark. does this mean with RF=3, all 3 nodes must be up and running for CAS updates? -Vivek On Wed, Apr 16, 2014 at 6:22 PM, Mark Reddy mark.re...@boxever.com wrote: create keyspace twitter with replication = {'class':'SimpleStrategy', 'replication_factor' : 3} Your replication

Re: Unable to complete request: one or more nodes were unavailable.

2014-04-16 Thread Tupshin Harper
No, but you do need a quorum of nodes. http://www.datastax.com/documentation/cassandra/2.0/cassandra/dml/dml_config_consistency_c.html SERIAL A write must be written conditionally to the commit log and memory table on a quorum of replica nodes. Used to achievelinearizable

Re: Unable to complete request: one or more nodes were unavailable.

2014-04-16 Thread Mark Reddy
The Paxos protocol used for CAS operations will always use at least a consistency level effectively equivalent to QUORUM (called SERIAL) when writing, even if you explicitly specify a lower level, e.g. ANY or ONE. Setting consistency level to ALL will make the write execute on all replicas if the

Re: Unable to complete request: one or more nodes were unavailable.

2014-04-16 Thread Vivek Mishra
Thanks Mark and Tuphsin. So on single node, if i set consistency level to SERIAL and create a keyspace with RF=1? Would that work? -Vivek On Wed, Apr 16, 2014 at 6:32 PM, Mark Reddy mark.re...@boxever.com wrote: The Paxos protocol used for CAS operations will always use at least a

Re: Unable to complete request: one or more nodes were unavailable.

2014-04-16 Thread Tupshin Harper
It will work for correctness, but give you a very inaccurate view of performance,. -Tupshin On Apr 16, 2014 6:05 AM, Vivek Mishra mishra.v...@gmail.com wrote: Thanks Mark and Tuphsin. So on single node, if i set consistency level to SERIAL and create a keyspace with RF=1? Would that work?

Re: Unable to complete request: one or more nodes were unavailable.

2014-04-16 Thread Vivek Mishra
Thanks, i think got the point.CAS doesn't make much sense on single node. -Vivek On Wed, Apr 16, 2014 at 6:37 PM, Tupshin Harper tups...@tupshin.com wrote: It will work for correctness, but give you a very inaccurate view of performance,. -Tupshin On Apr 16, 2014 6:05 AM, Vivek Mishra

GC histogram analysis

2014-04-16 Thread Ruchir Jha
Hi, I am trying to investigate ParNew promotion failures happening routinely in production. As part of this exercise, I enabled -XX:PrintHistogramBeforeFullGC and saw the following output. As you can see there are a ton of Columns, ExpiringColumns and DeletedColumns before GC ran and these

Re: GC histogram analysis

2014-04-16 Thread Mark Reddy
Do you delete and/or set TTLs on your data? On Wed, Apr 16, 2014 at 2:14 PM, Ruchir Jha ruchir@gmail.com wrote: Hi, I am trying to investigate ParNew promotion failures happening routinely in production. As part of this exercise, I enabled -XX:PrintHistogramBeforeFullGC and saw the

Re: GC histogram analysis

2014-04-16 Thread Ruchir Jha
No we don't. Sent from my iPhone On Apr 16, 2014, at 9:21 AM, Mark Reddy mark.re...@boxever.com wrote: Do you delete and/or set TTLs on your data? On Wed, Apr 16, 2014 at 2:14 PM, Ruchir Jha ruchir@gmail.com wrote: Hi, I am trying to investigate ParNew promotion failures

Re: How to clear all data using CQL?

2014-04-16 Thread James Horey
If you’re running unit tests and repeatadly clearing the Cassandra keyspaces, you may want to check out Ferry (ferry.opencore.io). It lets you standup/destroy multiple Cassandra stacks locally on your machine and is useful for the use case you described. I’m the author of Ferry, and would be

Re: GC histogram analysis

2014-04-16 Thread Chris Lohfink
You can take a heap dump and find out who has references to it. Can find out more which column family they are from. Do you have a lot of tombstones or have data thats over written a lot or and doing a ton of reads? Maybe wide rows that your querying across or using filtering? Reads could

Embedded Cassandra Performance

2014-04-16 Thread Sávio Teles
I'm running a cluster with Cassandra and my app embedded. Regarding performance, it is better to run embedded Cassandra? What are the implications of running an embedded Cassandra ? Tks -- Atenciosamente, Sávio S. Teles de Oliveira voice: +55 62 9136 6996 http://br.linkedin.com/in/savioteles

Re: Embedded Cassandra Performance

2014-04-16 Thread Sávio Teles
Is it advisable to run the embedded Cassandra in production? 2014-04-16 12:08 GMT-03:00 Sávio Teles savio.te...@lupa.inf.ufg.br: I'm running a cluster with Cassandra and my app embedded. Regarding performance, it is better to run embedded Cassandra? What are the implications of running an

Re: Problems with adding datacenter and schema version disagreement

2014-04-16 Thread Russell Hatch
I think you might be seeing the issue reported in https://issues.apache.org/jira/browse/CASSANDRA-6971 If that's the case, it looks like a fix will be in 2.0.7 Thanks, Russ On Tue, Apr 15, 2014 at 11:48 PM, Umut Kocasaraç ukocasa...@gmail.comwrote: Hi Olek, Could you solve the problem.

Re: Embedded Cassandra Performance

2014-04-16 Thread Chris Lohfink
There will be a small performance improvement from not having the app-cluster latency. If ran on same system (which I wouldn’t recommend) the latency would be pretty irrelevant anyway unless you are fighting for sub millisecond latency (in which case get off the JVM). You would be able to

Bootstrap Timing

2014-04-16 Thread Phil Burress
Greetings, How long does bootstrapping typically take? I have 3 existing nodes in our cluster with about 40GB each. I've added three new nodes to the cluster. They have been in bootstrap mode for a little over 3 days now. Should I be concerned? Is there a way to tell how long it will take to

Re: Bootstrap Timing

2014-04-16 Thread Robert Coli
On Wed, Apr 16, 2014 at 11:10 AM, Phil Burress philburress...@gmail.comwrote: How long does bootstrapping typically take? I have 3 existing nodes in our cluster with about 40GB each. I've added three new nodes to the cluster. They have been in bootstrap mode for a little over 3 days now.

Re: Bootstrap Timing

2014-04-16 Thread Phil Burress
Thanks very much for the response. I'm not using vnodes, does that matter? On Wed, Apr 16, 2014 at 2:13 PM, Robert Coli rc...@eventbrite.com wrote: On Wed, Apr 16, 2014 at 11:10 AM, Phil Burress philburress...@gmail.comwrote: How long does bootstrapping typically take? I have 3 existing

Re: Bootstrap Timing

2014-04-16 Thread Phil Burress
Also, one more quick question. For the new nodes, do I add all three existing nodes as seeds? Or just add one? On Wed, Apr 16, 2014 at 2:16 PM, Phil Burress philburress...@gmail.comwrote: Thanks very much for the response. I'm not using vnodes, does that matter? On Wed, Apr 16, 2014 at 2:13

Re: Bootstrap Timing

2014-04-16 Thread Ken Hancock
Seed nodes don't bootstrap. https://issues.apache.org/jira/browse/CASSANDRA-5836 On Wed, Apr 16, 2014 at 2:17 PM, Phil Burress philburress...@gmail.comwrote: Also, one more quick question. For the new nodes, do I add all three existing nodes as seeds? Or just add one? On Wed, Apr 16,

Re: Embedded Cassandra Performance

2014-04-16 Thread Sávio Teles
Thanks Chris! 2014-04-16 12:53 GMT-03:00 Chris Lohfink clohf...@blackbirdit.com: There will be a small performance improvement from not having the app-cluster latency. If ran on same system (which I wouldn’t recommend) the latency would be pretty irrelevant anyway unless you are fighting

Re: Bootstrap Timing

2014-04-16 Thread Robert Coli
On Wed, Apr 16, 2014 at 11:16 AM, Phil Burress philburress...@gmail.comwrote: Thanks very much for the response. I'm not using vnodes, does that matter? Not in your case. In some cases it is safe to bootstrap multiple nodes into a cluster at once AT SPECIFIC TOKENS, because there is more than

Re: Bootstrap Timing

2014-04-16 Thread Phil Burress
Thanks! On Wed, Apr 16, 2014 at 2:50 PM, Robert Coli rc...@eventbrite.com wrote: On Wed, Apr 16, 2014 at 11:16 AM, Phil Burress philburress...@gmail.comwrote: Thanks very much for the response. I'm not using vnodes, does that matter? Not in your case. In some cases it is safe to

Re: Bootstrap Timing

2014-04-16 Thread Phil Burress
I've shut down two of the nodes and am bootstrapping one right now. Is there any way to tell when it will finish bootstrapping? On Wed, Apr 16, 2014 at 2:56 PM, Phil Burress philburress...@gmail.comwrote: Thanks! On Wed, Apr 16, 2014 at 2:50 PM, Robert Coli rc...@eventbrite.com wrote: On

Stalled streams during repairs

2014-04-16 Thread Andrew Cooper
We are running into a reproducible issue in one of our cassandra clusters. We are seeing that during an anti-entropy repair, if a particular sstable is streaming to multiple endpoints and the two streams happen to hit the same section of the sstable, it stalls all streams indefinitely on the

Re: How safe is nodetool move in 1.2 ?

2014-04-16 Thread Richard Low
On 16 April 2014 05:08, Jonathan Lacefield jlacefi...@datastax.com wrote: Assuming you have enough nodes not undergoing move to meet your CL requirements, then yes, your cluster will still accept reads and writes. However, it's always good to test this before doing it in production to ensure

Re: Stalled streams during repairs

2014-04-16 Thread Robert Coli
On Wed, Apr 16, 2014 at 3:17 PM, Andrew Cooper andrew.coo...@nisc.coopwrote: We are running cassandra 1.2.5. I have checked through the change logs up to 1.2.16 and do not see any indications of this being a known (and fixed) issue. Repair has been re-written in 2.0, because it was broken;

Re: How safe is nodetool move in 1.2 ?

2014-04-16 Thread Colin
I have recently tested this scenario under a couple versions of Cassandra and have been able to write and read to/from the cluster while performing a move. I performed these tests utilizing an RF=2 on a three node cluster while performing quorum reads and received no errors due to unavailable

RE: Stalled streams during repairs

2014-04-16 Thread Andrew Cooper
First, thanks for the quick reply and jira links! Its helpful to know we are not the only ones experiencing these issues. Are you sure you actually want/need to run repair as frequently as you currently are? Reducing the frequency won't make it work any better, but it will reduce the number

Re: Apache cassandra not joining cluster ring

2014-04-16 Thread Joyabrata Das
After explicitly enabling port 9160 (Cassandra client port (Thrift)) in that particular node it's connected to ring however other nodes are working with same configuration. Now Opscenter is installed in that particular node after explicitly enabling 61620 (OpsCenter monitoring port), 61621