When to use STCS/DTCS/LCS

2015-04-08 Thread Ajay
Hi, What are the guidelines on when to use STCS/DTCS/LCS?. Most preferred way to test it with each of them and find the best fit. But is there some guidelines or best practices (out of experience) which one to use when? Thanks Ajay

Re: What's to think of when increasing disk size on Cassandra nodes?

2015-04-08 Thread Colin
Yikes, 18tb/node is a very bad idea. I dont like to go over 2-3 personally and you have to be careful with JBOD. See one of Ellis's latest posts on this and suggested use of LVM. It is a reversal on previous position re JBOD. -- Colin +1 612 859 6129 Skype colin.p.clark On Apr 8, 2015, at

Re: Availability testing of Cassandra nodes

2015-04-08 Thread Ajay
Adding Java driver forum. Even we like to know more on this. - Ajay On Wed, Apr 8, 2015 at 8:15 PM, Jack Krupansky jack.krupan...@gmail.com wrote: Just a couple of quick comments: 1. The driver is supposed to be doing availability and load balancing already. 2. If your cluster is lightly

Re: Keyspace creation taking 4-5 minutes

2015-04-08 Thread Nate McCall
Because the schema is stored in the system tables, you may be seeing some of the recent issues with compaction, particularly: https://issues.apache.org/jira/browse/CASSANDRA-8635 Upgrade to 2.1.4 and see if it helps (note: there is still https://issues.apache.org/jira/browse/CASSANDRA-8860 fixed

RE: Read Repair in cassandra

2015-04-08 Thread Jan Karlsson
The request would return with the latest data. The read request would fire against node 1 and node 3. The coordinator would get answers from both and would merge the answers and return the latest. Then read repair might run to update node 3. QUORUM does not take into consideration whether an

Re: Cluster status instability

2015-04-08 Thread Erik Forsberg
To elaborate a bit on what Marcin said: * Once a node starts to believe that a few other nodes are down, it seems to stay that way for a very long time (hours). I'm not even sure it will recover without a restart. * I've tried to stop then start gossip with nodetool on the node that thinks

Re: What's to think of when increasing disk size on Cassandra nodes?

2015-04-08 Thread Jack Krupansky
The preferred pattern for scaling data with Cassandra is to add nodes. Growing the disk on each node is an anti-pattern. The key strength of Cassandra is that it is a DISTRIBUTED database, so always keep your eye on distributing your data. But if you do need to grow disk, be sure to grow RAM and

OrderPreservingPartitioner and compound partition key

2015-04-08 Thread Serega Sheypak
Hi imagine I have a table events with fields: ymd int user_id uuid ts timestamp attr_1 attr_2 with primary key ((ymd, user_id, ts)) and I set OrderPreservingPartitioner as a partitioner for the table ymd is int representation for the day: 20150410, 20150411, e.t.c. Can I select from table using

What's to think of when increasing disk size on Cassandra nodes?

2015-04-08 Thread Thomas Borg Salling
I run a 10-node Cassandra cluster in production. 99% writes; 1% reads, 0% deletes. The nodes have 32 GB RAM; C* runs with 8 GB heap. Each node has a SDD for commitlog and 2x4 TB spinning disks for data (sstables). The schema uses key caching only. C* version is 2.1.2. It can be predicted that the

Availability testing of Cassandra nodes

2015-04-08 Thread Jiri Horky
Hi all, we are thinking of how to best proceed with availability testing of Cassandra nodes. It is becoming more and more apparent that it is rather complex task. We thought that we should try to read and write to each cassandra node to monitoring keyspace with a unique value with low TTL. This

Re: Availability testing of Cassandra nodes

2015-04-08 Thread Jack Krupansky
Just a couple of quick comments: 1. The driver is supposed to be doing availability and load balancing already. 2. If your cluster is lightly loaded, it isn't necessary to be so precise with load balancing. 3. If your cluster is heavily loaded, it won't help. Solution is to expand your cluster so

Re: What's to think of when increasing disk size on Cassandra nodes?

2015-04-08 Thread Jonathan Haddad
Agreed with Jack. Cassandra is a database meant to scale horizontally by adding nodes, and what you're describing is vertical scale. Aside from the vertical scale issue, unless you're running a very specific workload (time series data w/ Date Tiered Compaction) and you REALLY know what you're

Re: OrderPreservingPartitioner and compound partition key

2015-04-08 Thread Robert Coli
On Wed, Apr 8, 2015 at 1:27 AM, Serega Sheypak serega.shey...@gmail.com wrote: and I set OrderPreservingPartitioner as a partitioner for the table As a general statement, you almost certainly do not want to use the OrderPreservingPartitioner for any purpose. It should probably be called the

Re: What's to think of when increasing disk size on Cassandra nodes?

2015-04-08 Thread Jack Krupansky
I can certainly sympathize if you have IT staff/management who will willingly spring for some disk drives, but not for full machines, even if they are relatively commodity boxes. Seems penny-wise and pound-foolish to me, but management has their own priorities, plus there is the pre-existing

Keyspace creation taking 4-5 minutes

2015-04-08 Thread Saurabh Sethi
We have a 3 node cassandra 2.1.2 development cluster on which we frequently do a lot of keyspace creation and deletion. But after couple of days, we observed that keyspace creation is taking in order of 4-5 minutes. Anyone has any idea what might be causing it? FYI, if I run repair, cleanup

cqlsh commands for importing .CSV files into cassandra

2015-04-08 Thread Divya Divs
hi Please tell me the cqlsh commands for importing .csv file datasets into cassandra. please help to start. Iam using windows

Re: cqlsh commands for importing .CSV files into cassandra

2015-04-08 Thread Michael Dykman
http://docs.datastax.com/en/cql/3.0/cql/cql_reference/copy_r.html This only works through cqlsh. On Wed, Apr 8, 2015 at 1:48 PM, Divya Divs divya.divi2...@gmail.com wrote: hi Please tell me the cqlsh commands for importing .csv file datasets into cassandra. please help to start. Iam using

Re: What's to think of when increasing disk size on Cassandra nodes?

2015-04-08 Thread Nate McCall
First off, I agree that the preferred path is adding nodes, but it is possible. Can C* handle up to 18 TB data size per node with this amount of RAM? Depends on how deep in the weeds you want to get tuning and testing. See below. Is it feasible to increase the disk size by mounting a new