Re: Latest Stable version of cassandra in production

2014-01-09 Thread Rahul Menon
You should refer to this https://engineering.eventbrite.com/what-version-of-cassandra-should-i-run/ Thanks Rahul On Thu, Jan 9, 2014 at 8:06 AM, Sanjeeth Kumar sanje...@exotel.in wrote: Hi all, What is the latest stable version of cassandra you have in production ? We are migrating a

Re: Latest Stable version of cassandra in production

2014-01-09 Thread Jan Algermissen
On 09.01.2014, at 03:36, Sanjeeth Kumar sanje...@exotel.in wrote: Hi all, What is the latest stable version of cassandra you have in production ? We are migrating a large chunk of our mysql database to cassandra. I see a lot of discussions regarding 1.* versions, but I have not seen /

Re: Latest Stable version of cassandra in production

2014-01-09 Thread Laing, Michael
I would like to +1 Jan. We are using C* 2.0 and have just gone into production directly supporting the latest revision of www.nytimes.com. I avoid new features unless I really need them; we are prepared to read code and make fixes ourselves if necessary, but it has not been. Best regards,

Help on Designing Cassandra table for my usecase

2014-01-09 Thread Naresh Yadav
Hi all, I have a use case with huge data which i am not able to design in cassandra. Table name : MetricResult Sample Data : Metric=Sales, Time=Month, Period=Jan-10, Tag=U.S.A, Tag=Pen, Value=10 Metric=Sales, Time=Month, Period=Jan-10, Tag=U.S.A, Tag=Pencil, Value=20 Metric=Sales,

Moving data from 1.0.9 cluster to a 2.0.* cluster

2014-01-09 Thread Or Sher
Hi all, I want to use sstableloader in order to load 1.0.9 data to a 2.0.* cluster. I know that the sstable format is incompatible between the two versions. What are my options? Is there a tool to upgrade sstables directly without any real nodes involvement? -- Or Sher

Re: Help on Designing Cassandra table for my usecase

2014-01-09 Thread Thunder Stumpges
This sort of work sounds much more like a Hadoop/Hive/Pig type of analysis. What are your latency requirements on queries? Are they ad-hoc or part of an application? What is the case where you would need to change an existing value? If it is write once, then Hadoop/Hive is great, if it changes

Unable to handshake with earlier version

2014-01-09 Thread Elliot Sumner
I seem to be having a similar issue to https://www.mail-archive.com/user@cassandra.apache.org/msg33340.html Basically I have a new node of version 1.2.13, when attempting to connect to the ring it fails to handshake with other nodes in the cluster. This is attempting to connect to a 1.1.12

What are the Pros and Cons to upgrade Cassandra from 1.2.10 to 2.0.X ?

2014-01-09 Thread Miguel Angel Martin junquera
Hi all: We have a Cassandra 1.2.10 pre-production ring integrated with Hadoop and Pig and we are thinking to upgrade the system from 1.2.10 to 2.0.X ( to 2.0.4 for example) before we pass to production. What are the Pros and Cons According to your experience? Many thanks and Regards

Re: Latest Stable version of cassandra in production

2014-01-09 Thread Bruce Durling
So, what are you getting from 2.0 if you aren't using the new features? Why not stick with 1.2.x? cheers, Bruce On Thu, Jan 9, 2014 at 12:37 PM, Laing, Michael michael.la...@nytimes.com wrote: I would like to +1 Jan. We are using C* 2.0 and have just gone into production directly supporting

Re: What are the Pros and Cons to upgrade Cassandra from 1.2.10 to 2.0.X ?

2014-01-09 Thread Robert Coli
On Thu, Jan 9, 2014 at 8:46 AM, Miguel Angel Martin junquera mianmarjun.mailingl...@gmail.com wrote: We have a Cassandra 1.2.10 pre-production ring integrated with Hadoop and Pig and we are thinking to upgrade the system from 1.2.10 to 2.0.X ( to 2.0.4 for example) before we pass to

Re: Help on Designing Cassandra table for my usecase

2014-01-09 Thread Naresh Yadav
@thunder thanks for guidance queries will be fired by application on this table when users login and browse the application and also through mobile apps through webservice. Response needs to be quick as user will be doing analysis over this data on the fly. Writes also needs to be fast as there is

Re: Unable to handshake with earlier version

2014-01-09 Thread Robert Coli
On Thu, Jan 9, 2014 at 8:33 AM, Elliot Sumner elli...@playdemand.comwrote: I seem to be having a similar issue to https://www.mail-archive.com/user@cassandra.apache.org/msg33340.html Basically I have a new node of version 1.2.13, when attempting to connect to the ring it fails to handshake

Re: Help on Designing Cassandra table for my usecase

2014-01-09 Thread Naresh Yadav
@thunder It will be write once 80% of time but there can be cases client makes correction in data and then we need to overwrite that.. Thanks Naresh On Thu, Jan 9, 2014 at 11:49 PM, Naresh Yadav nyadav@gmail.com wrote: @thunder thanks for guidance queries will be fired by application

Re: Moving data from 1.0.9 cluster to a 2.0.* cluster

2014-01-09 Thread Robert Coli
On Thu, Jan 9, 2014 at 6:54 AM, Or Sher or.sh...@gmail.com wrote: I want to use sstableloader in order to load 1.0.9 data to a 2.0.* cluster. I know that the sstable format is incompatible between the two versions. What are my options?

Re: Help on Designing Cassandra table for my usecase

2014-01-09 Thread Hannu Kröger
To my eye that looks something what the traditional analytics systems do. You can check out e.g. Acunu Analytics which uses Cassandra as a backend. Cheers, Hannu 2014/1/9 Naresh Yadav nyadav@gmail.com Hi all, I have a use case with huge data which i am not able to design in cassandra.

unsubscribe

2014-01-09 Thread Earl Ruby
unsubscribe

Re: Help on Designing Cassandra table for my usecase

2014-01-09 Thread Thunder Stumpges
Well I think you have essentially time-series data, which C* should handle well, however I think your Tag column is going to cause troubles. C* does have collection columns, but they are not indexable nor usable in WHERE clause. Your example has both the uniqueness of the data (primary key) and

Re: unsubscribe

2014-01-09 Thread Dave Brosius
just send that email to user-unsubscribe@cassandra.apache.orgif still confused check here http://hadonejob.com/img/full/12598654.jpg - Original Message -From: quot;Earl Rubyquot; ;er...@webcdr.com

Optimal Way to Tune For Searchs For Missing Keys

2014-01-09 Thread Charlie Mason
Hi All, Does anyone have any suggestions about how to improve performance in the below use case? I have a very simple table with a single Partition Key, and one Cluster key. My app is periodically writing new entries in the table and deleting old ones. There are a lot more reads than writes on

Re: What are the Pros and Cons to upgrade Cassandra from 1.2.10 to 2.0.X ?

2014-01-09 Thread Thunder Stumpges
I don't have specific experience upgrading from 1.x to 2.x but I do have to say that if you have Pig/Hadoop integration working, go with that. I ran into many small issues getting the integration working with just the right version of Hadoop/Pig/and Cassandra 2.0.x On Jan 9, 2014, at 8:46 AM,

Re: Latest Stable version of cassandra in production

2014-01-09 Thread Laing, Michael
Good: doesn't OOM on smallish machines, can use defaults for almost all params w good results. Bad: watch the list like a hawk to avoid problems others have, be aware of bug fixes, workarounds, and Jira issues. ml On Thu, Jan 9, 2014 at 12:58 PM, Bruce Durling b...@otfrom.com wrote: So, what

nodetool flush usage

2014-01-09 Thread Christopher J. Bottaro
Am I correct in understanding that it needs to be run on each node in the cluster? For example, if I have a three node cluster, I'd have to run: nodetool -h node-1 flush nodetool -h node-2 flush nodetool -h node-3 flush ? Also, does block until it's done? My use case is recreating a keyspace.

Re: Latest Stable version of cassandra in production

2014-01-09 Thread Bruce Durling
Michael, On Thu, Jan 9, 2014 at 10:15 PM, Laing, Michael michael.la...@nytimes.com wrote: Good: doesn't OOM on smallish machines, can use defaults for almost all params w good results. Bad: watch the list like a hawk to avoid problems others have, be aware of bug fixes, workarounds, and Jira

Re: Optimal Way to Tune For Searchs For Missing Keys

2014-01-09 Thread Robert Coli
On Thu, Jan 9, 2014 at 1:42 PM, Charlie Mason charlie@gmail.com wrote: There are a lot more reads than writes on this particular table. All of the queries are just for the partition key. Most of the queries are for partition keys that don't exists, more than 99% of the queries. Reads for

Re: Struggling to understand CFS and its use.

2014-01-09 Thread Ben Coverston
+1 to what Ed said. CFS is a good facilitator for running MR jobs on Cassandra to fill the HDFS requirement (you just want to run MR, but you don't want the whole Hadoop stack). The source data for your MR jobs should be in Cassandra KS/CFs. On Mon, Nov 18, 2013 at 3:21 PM, Edward Capriolo

Re: Nodetool ring

2014-01-09 Thread Aaron Morton
Owns is how much of the entire, cluster wide, data set the node has. In both your examples every node has a full copy of the data. If you have 6 nodes and RF 3 they would have 50%. Cheers - Aaron Morton New Zealand @aaronmorton Co-Founder Principal Consultant Apache

Re: Cassandra consuming too much memory in ubuntu as compared to within windows, same machine.

2014-01-09 Thread Aaron Morton
When Xms and Xmx are the same like this the JVM allocates all the memory, and then on Linux cassandra will ask the OS to lock that memory so it cannot be paged out. On windows it’s probably getting paged out. If you only have 4GB on the box, you probably do not want to run cassandra with 4GB.

Re: massive spikes in read latency

2014-01-09 Thread Aaron Morton
The spikes in latency don’t seem to be correlated to an increase in reads. The cluster’s workload is usually handling a maximum workload of 4200 reads/sec per node, with writes being significantly less, at ~200/sec per node. Usually it will be fine with this, with read latencies at around

Re: nodetool cleanup / TTL

2014-01-09 Thread Aaron Morton
Is there some other mechanism for forcing expired data to be removed without also compacting? (major compaction having obvious problematic side effects, and user defined compaction being significant work to script up). Tombstone compactions may help here

Re: offheap component

2014-01-09 Thread Robert Coli
On Thu, Jan 9, 2014 at 5:02 PM, Aaron Morton aa...@thelastpickle.comwrote: Row cache has been off heap for a while, and the key cache is still on heap. Usage of the off heap row cache has enough associated heap consumption that I feel compelled to mention that the name off heap is slightly

Re: upgrade from cassandra 1.2.3 - 1.2.13 + start using SSL

2014-01-09 Thread Aaron Morton
We avoid mixing versions for a long time, but we always upgrade one node and check the application is happy before proceeding. e.g. wait for 30 minutes before upgrading the others. If you snapshot before upgrading, and have to roll back after 30 minutes you can roll back to the snapshot and

Re: What are the Pros and Cons to upgrade Cassandra from 1.2.10 to 2.0.X ?

2014-01-09 Thread Jacob Rhoden
On 10 Jan 2014, at 3:46 am, Miguel Angel Martin junquera mianmarjun.mailingl...@gmail.com wrote: We have a Cassandra 1.2.10 pre-production ring integrated with Hadoop and Pig and we are thinking to upgrade the system from 1.2.10 to 2.0.X ( to 2.0.4 for example) before we pass to

Can't start service with error: java.lang.IllegalStateException: Unable to contact any seeds

2014-01-09 Thread Francisco Dalla Rosa Soares
Hello everyone, I've tried to google all I could and also asking at ServerFault first but as I got no answer I decided to come to the list. I have 6 machines that I want to use to make a cluster using Cassandra 2.0 Cassandra start in the machines, however it dies after a while with the error

Re: Help on Designing Cassandra table for my usecase

2014-01-09 Thread Naresh Yadav
@Thunder thanks for suggesting design but my main problem is indexing/quering dynamic Tag on each row that is main context of each row and most of queries will include that.. As an alternative to cassandra, i tried Apache Blur, in blur table i am able to store exact same data and all queries also