Re: vnodes ready for production ?

2013-06-18 Thread Alain RODRIGUEZ
Any insights on vnodes, one month after my original post ? 2013/5/16 Alain RODRIGUEZ arodr...@gmail.com Hi, Adding vnodes is a big improvement to Cassandra, specifically because we have a fluctuating load on our Cassandra depending on the week, and it is quite annoying to add some nodes

Re: [Cassandra] Expanding a Cassandra cluster

2013-06-18 Thread Richard Low
On 10 June 2013 22:00, Emalayan Vairavanathan svemala...@yahoo.com wrote: b) Will Cassandra automatically take care of removing obsolete keys in future ? In a future version Cassandra should automatically clean up for you:

What is the effect of reducing the thrift message sizes on GC

2013-06-18 Thread Ananth Gundabattula
We are currently running on 1.1.10 and planning to migrate to a higher version 1.2.4. The question pertains to tweaking all the knobs to reduce GC related issues ( we have been fighting a lot of really bad GC issues on 1.1.10 and met with little success all the way using 1.1.10) Taking into

rename a cluster in cassandra 1.2.6

2013-06-18 Thread Paco a.k.a. Francisco Trujillo
I am using cassandra 1.2.6 in cluster with a single node. I am trying to rename the cluster using the instructions in: Cassandra clustername mismatch After doing all the steps indicate I continue with the same error when I start cassandra after change the cassandra.yaml file Do anyone Know if

Re: rename a cluster in cassandra 1.2.6

2013-06-18 Thread aaron morton
The cluster name is read from the yaml file the first time the server starts and stored in the system tables, these are in the local CF in the system KS. If this is test system just blow away the data for the CF or truncate it. Cheers - Aaron Morton Freelance Cassandra

Re: What is the effect of reducing the thrift message sizes on GC

2013-06-18 Thread aaron morton
*thrift_framed_transport_size_in_mb thrift_max_message_length_in_mb* This control the max size of a bugger allocated by thrift when processing requests / responses. The buffers are not pre allocated, but once they are allocated they are not returned. So it's only an issue if have lots of

Re: vnodes ready for production ?

2013-06-18 Thread aaron morton
Even more if we could automate some up-scale thanks to AWS alarms, It would be awesome. I saw a demo for Priam (https://github.com/Netflix/Priam) doing that at netflix in March, not sure if it's public yet. Are the vnodes feature and the tokens =vnodes transition safe enough to go live

Re: Reduce Cassandra GC

2013-06-18 Thread Joel Samuelsson
Can't find any promotion failure. In system.log this is what I get: INFO [ScheduledTasks:1] 2013-06-17 08:13:47,490 GCInspector.java (line 122) GC for ParNew: 145189 ms for 1 collections, 225905072 used; max is 4114612224 INFO [ScheduledTasks:1] 2013-06-17 08:13:47,490 StatusLogger.java (line

Re: Node failing to decomission (vnodes and 1.2.5)

2013-06-18 Thread aaron morton
I also am not seeing anything in the nodes log files to suggest errors during streaming or leaving. You should see a log message saying DECOMMISSIONED when the process completes. What does nodetool status say? What suggestions does anyone have on getting this node removed from my ring so

Re: What is the effect of reducing the thrift message sizes on GC

2013-06-18 Thread Ananth Gundabattula
Thanks Aaron for the insight. One quick question: The buffers are not pre allocated, but once they are allocated they are not returned. So it's only an issue if have lots of clients connecting and reading a lot of data. So to understand you correctly, the buffer is allocated per client

SQL Injection C* (via CQL Thrift)

2013-06-18 Thread Brian O'Neill
Mostly for fun, I wanted to throw this out there... We are undergoing a security audit for our platform (C* + Elastic Search + Storm). One component of that audit is susceptibility to SQL injection. I was wondering if anyone has attempted to construct a SQL injection attack against Cassandra?

Data not fully replicated with 2 nodes and replication factor 2

2013-06-18 Thread James Lee
Hello, I'm seeing a strange problem with a 2-node Cassandra test deployment, where it seems that data isn't being replicated among the nodes as I would expect. I suspect this may be a configuration issue of some kind, but have been unable to figure what I should change. The setup is as

Re: Reduce Cassandra GC

2013-06-18 Thread Takenori Sato
GC logging is not in system.log. But in the following file. JVM_OPTS=$JVM_OPTS -Xloggc:/var/log/cassandra/gc-`date +%s`.log At least, no GC logs are shown in your post. On Tue, Jun 18, 2013 at 5:05 PM, Joel Samuelsson samuelsson.j...@gmail.comwrote: Can't find any promotion failure. In

Re: SQL Injection C* (via CQL Thrift)

2013-06-18 Thread Sylvain Lebresne
If you're not careful, then CQL injection is possible. Say you naively build you query with UPDATE foo SET col=' + user_input + ' WHERE key = 'k' then if user_input is foo' AND col2='bar, your user will have overwritten a column it shouldn't have been able to. And something equivalent in a

Re: Large number of files for Leveled Compaction

2013-06-18 Thread Franc Carter
On Mon, Jun 17, 2013 at 3:37 PM, Franc Carter franc.car...@sirca.org.auwrote: On Mon, Jun 17, 2013 at 3:28 PM, Wei Zhu wz1...@yahoo.com wrote: default value of 5MB is way too small in practice. Too many files in one directory is not a good thing. It's not clear what should be a good number.

RE: What is the effect of reducing the thrift message sizes on GC

2013-06-18 Thread Viktor Jevdokimov
Our experience shows that write load (memtables) impacts ParNew GC most. More writes, more frequent ParNew GC. Time of ParNew GC depends on how many writes was made during cycle between ParNew GC's and size of NEW_HEAP (young gen). Basicly ParNew GC itself takes longer when more objects have to

Re: SQL Injection C* (via CQL Thrift)

2013-06-18 Thread Brian O'Neill
Perfect. Thanks Sylvain. That is exactly the input I was looking for, and I agree completely. (t's easy enough to protect against) As for the thrift side (i.e. using Hector or Astyanax), anyone have a crafty way to inject something? At first glance, it doesn't appear possible, but I'm not

Re: Node failing to decomission (vnodes and 1.2.5)

2013-06-18 Thread David McNelis
Never saw decommissioned in the logs, status continues to says UL on status. Removenode sounds like its likely to get the job done for us at this point. Thanks. David On Tue, Jun 18, 2013 at 3:10 AM, aaron morton aa...@thelastpickle.comwrote: I also am not seeing anything in the nodes log

Heap is not released and streaming hangs at 0%

2013-06-18 Thread srmore
I see an issues when I run high traffic to the Cassandra nodes, the heap gets full to about 94% (which is expected) but the thing that confuses me is that the heap usage never goes down after the traffic is stopped (at-least, it appears to be so) . I kept the nodes up for a day after stopping the

Re: Reduce Cassandra GC

2013-06-18 Thread Joel Samuelsson
Yes, like I said, the only relevant output from that file was: 2013-06-17T08:11:22.300+: 2551.288: [GC 870971K-216494K(4018176K), 145.1887460 secs] 2013/6/18 Takenori Sato ts...@cloudian.com GC logging is not in system.log. But in the following file. JVM_OPTS=$JVM_OPTS

Re: Reduce Cassandra GC

2013-06-18 Thread Mohit Anchlia
Is your young generation size set to 4GB? Can you paste the output of ps -ef|grep cassandra ? On Tue, Jun 18, 2013 at 8:48 AM, Joel Samuelsson samuelsson.j...@gmail.comwrote: Yes, like I said, the only relevant output from that file was: 2013-06-17T08:11:22.300+: 2551.288: [GC

ANN Introducing Cassaforte, a Clojure client for Cassandra built around CQL 3.0

2013-06-18 Thread Michael Klishin
Cassaforte [1] is a Clojure client for Cassandra built around CQL 3.0 and focusing on ease of use. It's built on top of the new DataStax Java driver [2] and supports all the major features you'd expect from a data store client: * Connection to a single node or a cluster * All CQL 3.0 operations

Dropped mutation messages

2013-06-18 Thread cem
Hi All, I have a cluster of 5 nodes with C* 1.2.4. Each node has 4 disks 1 TB each. I see a lot of dropped messages after it stores 400 GB per disk. (1.6 TB per node). The recommendation was 500 GB max per node before 1.2. Datastax says that we can store terabytes of data per node with 1.2.

Re: Heap is not released and streaming hangs at 0%

2013-06-18 Thread Robert Coli
On Tue, Jun 18, 2013 at 8:25 AM, srmore comom...@gmail.com wrote: I see an issues when I run high traffic to the Cassandra nodes, the heap gets full to about 94% (which is expected) Which is expected to cause GC failure? ;) But seriously, the reason your node is unable to GC is that you have

Re: rename a cluster in cassandra 1.2.6

2013-06-18 Thread Faraaz Sareshwala
Can you expand on the reasoning behind this? I was bitten by this yesterday when trying to change the cluster name -- I thought I could just change it in the cassandra.yaml and be done with it but cassandra wouldn't start because of this error. What's the process when it's not a test system (mine

Re: Heap is not released and streaming hangs at 0%

2013-06-18 Thread srmore
Thanks Rob, But then shouldn't JVM C G it eventually ? I can still see Cassandra alive and kicking but looks like the heap is locked up even after the traffic is long stopped. nodetool -h localhost flush didn't do much good. the version I am running is 1.0.12 (I know its due for a upgrade but

Re: rename a cluster in cassandra 1.2.6

2013-06-18 Thread Robert Coli
On Tue, Jun 18, 2013 at 10:20 AM, Faraaz Sareshwala fsareshw...@quantcast.com wrote: Can you expand on the reasoning behind this? https://issues.apache.org/jira/browse/CASSANDRA-769 In various versions of Cassandra (including current, IIRC?) you can change the cluster name via manual edits to

Re: Heap is not released and streaming hangs at 0%

2013-06-18 Thread Robert Coli
On Tue, Jun 18, 2013 at 10:33 AM, srmore comom...@gmail.com wrote: But then shouldn't JVM C G it eventually ? I can still see Cassandra alive and kicking but looks like the heap is locked up even after the traffic is long stopped. No, when GC system fails this hard it is often a permanent

Re: Data not fully replicated with 2 nodes and replication factor 2

2013-06-18 Thread Wei Zhu
Cassandra doesn't do async replication like HBase does.You can run nodetool repair to insure the consistency. Or you can increase your Read or Write consistency. As long as R + W RF, you have strong consistency. In your case, you can use CL.TWO for either read and write. -Wei -

Re: Data not fully replicated with 2 nodes and replication factor 2

2013-06-18 Thread Robert Coli
On Tue, Jun 18, 2013 at 11:36 AM, Wei Zhu wz1...@yahoo.com wrote: Cassandra doesn't do async replication like HBase does.You can run nodetool repair to insure the consistency. While this answer is true, it is somewhat non-responsive to the OP. If the OP didn't see timeout exception, the

Re: [Cassandra] Expanding a Cassandra cluster

2013-06-18 Thread Emalayan Vairavanathan
Thank you all. I have two more question. 1) Is there any implication in running nodetool repair immediately after bringing a new node up (before key migration process is completed) ?         Will it cause some race conditions ? Or will it result in some part of the space never be reclaimed ?

Re: Compaction not running

2013-06-18 Thread Franc Carter
On Sat, Jun 15, 2013 at 11:49 AM, Franc Carter franc.car...@sirca.org.auwrote: On Sat, Jun 15, 2013 at 8:48 AM, Robert Coli rc...@eventbrite.com wrote: On Wed, Jun 12, 2013 at 3:26 PM, Franc Carter franc.car...@sirca.org.au wrote: We are running a test system with Leveled compaction on

Re: Compaction not running

2013-06-18 Thread Bryan Talbot
Manual compaction for LCS doesn't really do much. It certainly doesn't compact all those little files into bigger files. What makes you think that compactions are not occurring? -Bryan On Tue, Jun 18, 2013 at 3:59 PM, Franc Carter franc.car...@sirca.org.auwrote: On Sat, Jun 15, 2013 at

Re: Compaction not running

2013-06-18 Thread Franc Carter
On Wed, Jun 19, 2013 at 9:34 AM, Bryan Talbot btal...@aeriagames.comwrote: Manual compaction for LCS doesn't really do much. It certainly doesn't compact all those little files into bigger files. What makes you think that compactions are not occurring? Yeah, that's what I thought,

Unit Testing Cassandra

2013-06-18 Thread Shahab Yunus
Hello, Can anyone suggest a good/popular Unit Test tools/frameworks/utilities out there for unit testing Cassandra stores? I am looking for testing from performance/load and monitoring perspective. I am using 1.2. Thanks a lot. Regards, Shahab

Re: Dropped mutation messages

2013-06-18 Thread Arthur Zubarev
Cem hi, as per http://wiki.apache.org/cassandra/FAQ#dropped_messages Internode messages which are received by a node, but do not get not to be processed within rpc_timeout are dropped rather than processed. As the coordinator node will no longer be waiting for a response. If the Coordinator