Re: Ec2 Stress Results

2011-05-06 Thread Alex Araujo
months ago I was seeing 12k writes/s on a single EC2 XL. So something is wrong. My first suspicion is that your client node may be the bottleneck. On Wed, Apr 20, 2011 at 2:56 PM, Alex Araujo cassandra-us...@alex.otherinbox.com mailto:cassandra-us...@alex.otherinbox.com wrote

Re: Ec2 Stress Results

2011-05-09 Thread Alex Araujo
On 5/6/11 9:47 PM, Jonathan Ellis wrote: On Fri, May 6, 2011 at 5:13 PM, Alex Araujo cassandra-us...@alex.otherinbox.com wrote: I raised the default MAX_HEAP setting from the AMI to 12GB (~80% of available memory). This is going to make GC pauses larger for no good reason. Good point - only

Re: Data types for cross language access

2011-05-11 Thread Alex Araujo
On 5/11/11 5:27 AM, Oliver Dungey wrote: I am currently working on a system with Cassandra that is written purely in Java. I know our end solution will require other languages to access the data in Cassandra (Python, C++ etc.). What is the best way to store data to ensure I can do this? Should

Re: Ec2 Stress Results

2011-05-11 Thread Alex Araujo
On 5/9/11 9:49 PM, Jonathan Ellis wrote: On Mon, May 9, 2011 at 5:58 PM, Alex Araujocassandra- How many replicas are you writing? Replication factor is 3. So you're actually spot on the predicted numbers: you're pushing 20k*3=60k raw rows/s across your 4 machines. You might get another 10%

Re: Ec2 Stress Results

2011-05-11 Thread Alex Araujo
Hey Adrian - Why did you choose four big instances rather than more smaller ones? Mostly to see the impact of additional CPUs on a write only load. The portion of the application we're migrating from MySQL is very write intensive. The other 8 core option was c1.xl with 7GB of RAM. I will

Re: How to invoke getNaturalEndpoints with jconsole?

2011-05-13 Thread Alex Araujo
On 5/13/11 10:08 AM, Maki Watanabe wrote: I wrote a small JMX client to invoke getNaturalEndpoints. It works fine at my test environment, but throws NPE for keyspace we will use for our application (both 0.7.5). Does anyone know quick resolution of that before I setting up cassandra on eclipse

Atomicity Strategies

2011-04-08 Thread Alex Araujo
Hi, I was wondering if there are any patterns/best practices for creating atomic units of work when dealing with several column families and their inverted indices. For example, if I have Users and Groups column families and did something like: Users.insert( user_id, columns )

Re: Atomicity Strategies

2011-04-08 Thread Alex Araujo
, Alex Araujo wrote: Hi, I was wondering if there are any patterns/best practices for creating atomic units of work when dealing with several column families and their inverted indices. For example, if I have Users and Groups column families and did something like: Users.insert( user_id, columns

Ec2 Stress Results

2011-04-20 Thread Alex Araujo
Does anyone have any Ec2 benchmarks/experiences they can share? I am trying to get a sense for what to expect from a production cluster on Ec2 so that I can compare my application's performance against a sane baseline. What I have done so far is: 1. Lunched a 4 node cluster of m1.xlarge