Re: Hector has a website

2011-10-07 Thread Radim Kolar
i have question for HOM FAQ: http://rantav.github.com/hector/build/html/content/HOM/hector-object-mapper.html#hom-faq please write short example how to persist objects in batch.

Re: how to reduce disk read? (and bloom filter performance)

2011-10-07 Thread Radim Kolar
Dne 16.9.2011 8:20, Yang napsal(a): I looked at the JMX attributes CFS.BloomFilterFalseRatio, it's 1.0 , BloomFilterFalsePositives, it's 2810, its possible to query this bloom filter false ratio from command line?

Re: Hector has a website

2011-10-07 Thread aaron morton
Thanks, will be handy for new peeps. A - Aaron Morton Freelance Cassandra Developer @aaronmorton http://www.thelastpickle.com On 7/10/2011, at 12:00 PM, Patricio Echagüe wrote: Hi, I wanted to let you all know that Hector client has a website. http://hector-client.org

Re: how to reduce disk read? (and bloom filter performance)

2011-10-07 Thread aaron morton
Of the top of my head I it's not exposed via nodetool. You can get it via HTTP if you install mx4j or if you could try http://wiki.cyclopsgroup.org/jmxterm Cheers - Aaron Morton Freelance Cassandra Developer @aaronmorton http://www.thelastpickle.com On 7/10/2011, at 8:09 PM,

ebs or ephemeral

2011-10-07 Thread Madalina Matei
Hi, I'm looking to deploy a 5 nodes cluster in EC2 with RF3 and QUORUM CL. Could you please advice me on EBS vs ephemeral storage ? Cheers, Madalina

Re: ebs or ephemeral

2011-10-07 Thread Yi Yang
Obviously ephemeral. It has higher IO availability, will not affect your Ethernet IO performance, and it is free (included in instance price) and the redundancy is provided by cassandra itself. 從我的 BlackBerry® 無線裝置 -Original Message- From: Madalina Matei madalinaima...@gmail.com Date:

Re: ebs or ephemeral

2011-10-07 Thread aaron morton
Data Stax have pre build AMI's here http://www.datastax.com/dev/blog/setting-up-a-cassandra-cluster-with-the-datastax-ami And an explanation of why we normally avoid ephemeral. Also, I would go with 6 nodes. You will then be able to handle up to 2 failed nodes. Hope that helps.

Re: ebs or ephemeral

2011-10-07 Thread Madalina Matei
Hi Aaron, For a 6 nodes cluster, what RF can we use in order to support 2 failed nodes? From the article that you sent i understood avoid EMS and use ephemeral. am i missing anything? Thank you so much for your help, Madaina On Fri, Oct 7, 2011 at 9:15 AM, aaron morton

Re: how to reduce disk read? (and bloom filter performance)

2011-10-07 Thread Radim Kolar
Dne 7.10.2011 10:04, aaron morton napsal(a): Of the top of my head I it's not exposed via nodetool. You can get it via HTTP if you install mx4j or if you could try http://wiki.cyclopsgroup.org/jmxterm i have MX4J/Http but cant find that info in listing. i suspect that bloom filter

Re: Batch writes getting slow

2011-10-07 Thread Stefan Reek
Is it actually filling up enough to trigger an old-gen CMS gc? Yes, it fills up to the 16G and then it starts doing the CMS gc's which dramatically decreases the performance. I'm still not sure why it does this, as a nodetool info states the load as less than 4G. Any ideas? On 10/06/2011

Re: Batch writes getting slow

2011-10-07 Thread Jonathan Ellis
Look at the OOM section in http://www.datastax.com/docs/0.6/troubleshooting/index On Fri, Oct 7, 2011 at 3:47 AM, Stefan Reek ste...@unitedgames.com wrote: Is it actually filling up enough to trigger an old-gen CMS gc? Yes, it fills up to the 16G and then it starts doing the CMS gc's which

Immutable CFs and read consistency

2011-10-07 Thread Anthony Wilcox
We have a Column Family that is immutable (no updates after the first write). Suppose we use RF=2 and W=1. Do we still need R=2, so that R+WRF? Or is it sufficient to have R=1? My guess is yes, R=1 is sufficient since if it reads a row, the row has to be correct: it can't get stale data.

RE: Immutable CFs and read consistency

2011-10-07 Thread Anthony Wilcox
The last sentence should have been In other words, does R apply also to (possibly failed) read attempts or only to successful reads? Anthony From: peacepatr...@hotmail.com To: user@cassandra.apache.org Subject: Immutable CFs and read consistency Date: Fri, 7 Oct 2011 06:21:04 -0700 We

RE: Hinted handoff question

2011-10-07 Thread Rene Kochen
If I trigger hint delivery using JMX, it works. I see in the log: 2011-10-07 15:17:51,216 INFO 15:17:51,216 Started hinted handoff for endpoint /172.16.108.19 However, it does not work automatically (i.e. after node 3 is up again). I tested it with Cassandra 0.8.6 and there it works as

RE: Immutable CFs and read consistency

2011-10-07 Thread Dan Hendry
The R+W RF requirement for strong consistency applies regardless of whether your data is 'immutable' or is being updated. A W=1, R=1 approach will not guarantee consistency between reads and writes. R=1 might cassandra look on one of the two nodes, find no data there, and prematurely give

Re: how to reduce disk read? (and bloom filter performance)

2011-10-07 Thread Mohit Anchlia
Check your disk utilization using iostat. Also, check if compactions are causing reads to be slow. Check GC too. You can look at cfhistograms output or post it here. On Fri, Oct 7, 2011 at 1:44 AM, Radim Kolar h...@sendmail.cz wrote: Dne 7.10.2011 10:04, aaron morton napsal(a): Of the top of

Re: For multi-tenant, is it good to have a key space for each tenant?

2011-10-07 Thread David McNelis
Reviving this thread... Say you want to enable potentially thousands of tenants with their own sets of column families? In this situation a keyspace for each tenant wouldn't work it would seem...what issues would we likely see if we were to build out thousands of column families (where a column

Re: Hinted handoff question

2011-10-07 Thread Jonathan Ellis
This was fixed in 0.7.8 by CASSANDRA-2928 In general it's a good idea to upgrade to the latest stable release before spending a lot of time debugging :) On Fri, Oct 7, 2011 at 8:33 AM, Rene Kochen rene.koc...@emea.schange.com wrote: If I trigger hint delivery using JMX, it works. I see in the

Re: For multi-tenant, is it good to have a key space for each tenant?

2011-10-07 Thread Jonathan Ellis
On Fri, Oct 7, 2011 at 9:36 AM, David McNelis dmcne...@agentisenergy.com wrote:  In some documentation I've read it says that keyspace's take up the majority of the resources This has never been the case. in a couple of older threads they talked about getting the number of column families

Re: For multi-tenant, is it good to have a key space for each tenant?

2011-10-07 Thread David McNelis
So at the end of the day its going to be limited by available memory then? Going by this line: Do note that a minimum of 1MB per memtable is used by the per-memtable arena allocator https://issues.apache.org/jira/browse/CASSANDRA-2252 also introduced in 1.0, which is worth keeping in mind if you

Re: For multi-tenant, is it good to have a key space for each tenant?

2011-10-07 Thread Edward Capriolo
On Fri, Oct 7, 2011 at 10:47 AM, David McNelis dmcne...@agentisenergy.comwrote: So at the end of the day its going to be limited by available memory then? Going by this line: Do note that a minimum of 1MB per memtable is used by the per-memtable arena allocator

RE: Hinted handoff question

2011-10-07 Thread Rene Kochen
I missed that one in the release notes. Thank you! -Original Message- From: Jonathan Ellis [mailto:jbel...@gmail.com] Sent: vrijdag 7 oktober 2011 16:39 To: user@cassandra.apache.org Subject: Re: Hinted handoff question This was fixed in 0.7.8 by CASSANDRA-2928 In general it's a good

Re: GC for ParNew on 0.8.6

2011-10-07 Thread Richard Low
Hi Philippe, On Thu, Sep 29, 2011 at 6:47 AM, Philippe watche...@gmail.com wrote: No it was an upgrade from 0.8.4 or 0.8.5 depending on the nodes. No cassandra-env files were changed during the update. Any other ideas?  The cluster has just been weird ever since running 0.8.6 : has anyone

Re: 0.7.9 RejectedExecutionException

2011-10-07 Thread Ashley Martens
Okay, this is still a problem. This node keeps dieing at 1am every day, most times without an error in the log. I'd appriciate any help in tracking down why. Additionally, I don't understand why 0.7.x using *way* more RAM than 0.6.x and 0.8.x, from a top or ps perspective. I'm now watching the

Re: how to reduce disk read? (and bloom filter performance)

2011-10-07 Thread Radim Kolar
Dne 7.10.2011 15:55, Mohit Anchlia napsal(a): Check your disk utilization using iostat. Also, check if compactions are causing reads to be slow. Check GC too. You can look at cfhistograms output or post it here. i dont know how to interpret cf historgrams. can you write it to wiki?

Re: how to reduce disk read? (and bloom filter performance)

2011-10-07 Thread Mohit Anchlia
You'll see output like: Offset SSTables 1 8021 2 783 Which means 783 read operations accessed 2 SSTables On Fri, Oct 7, 2011 at 2:03 PM, Radim Kolar h...@sendmail.cz wrote: Dne 7.10.2011 15:55, Mohit Anchlia napsal(a): Check your disk utilization using