Re: reads/s suddenly dropped

2012-02-20 Thread Franc Carter
On Mon, Feb 20, 2012 at 12:00 PM, aaron morton aa...@thelastpickle.comwrote: Aside from iostats.. nodetool cfstats will give you read and write latency for each CF. This is the latency for the operation on each node. Check that to see if latency is increasing. Take a look at nodetool

Final buffer length 4690 to accomodate data size of 2347 for RowMutation error caused node death

2012-02-20 Thread Thomas van Neerijnen
Hi all I am running the Apache packaged Cassandra 1.0.7 on Ubuntu 11.10. It has been running fine for over a month however I encountered the below error yesterday which almost immediately resulted in heap usage rising quickly to almost 100% and client requests timing out on the affected node. I

is it possible to read obsolete data after write?

2012-02-20 Thread Hontvári József Levente
I noticed a strange phenomenon with Cassandra, and I would like to know if this is something completely impossible, or not. As you can see in the log extract below, as new versions of a row is written out, the reads returns obsolete data after a while (they read version 78 when 79 and even 80

Re: is it possible to read obsolete data after write?

2012-02-20 Thread Hontvári József Levente
The appearance of the old rows were caused by old timestamps set on columns (which in turn caused by some ThreadLocals which were not cleaned up). Since I fixed the timestamp, all rows returned corresponds to their latest saved state in each and every case. On 2012.02.20. 13:32, Hontvári

Max TTL?

2012-02-20 Thread Erik Forsberg
Hi! When setting ttl on columns, is there a maximum value (other than MAXINT, 2**31-1) that can be used? I have a very odd behaviour here, where I try to set ttl to 9 622 973 (~111 days) which works, but setting it to 11 824 305 (~137 days) does not - it seems columns are deleted instantly

Help needed to solve an issue with Cassandra cluster

2012-02-20 Thread Kasun Weranga
Hi All, I have setup 3 nodes Cassandra cluster(cassandra version is 0.7). In the code base I am creating few column families using Hector client with default consistency level and the replication factor of key space is 2. While running the code I am getting following exception repeatedly on

Re: Newbie Question: Cassandra consuming 100% CPU on ubuntu server

2012-02-20 Thread Aditya Gupta
@Tamar: Thanks a ton! works perfect now! On Mon, Feb 20, 2012 at 12:02 PM, Tamar Fraenkel ta...@tok-media.comwrote: On 19/02/2012 21:55, Aditya Gupta wrote: Is there anything to do with running cassandra on a VMware ubuntu instance !? I am trying Cassandra on VMware ubuntu server instance.

Wide Row Performance Index Question

2012-02-20 Thread Blake Starkenburg
Question pertaining to wide or large rows in Cassandra. I recall reading in a blog I believe posted by Aaron Morton a notation that Cassandra creates its own index of a row when it reaches X amount of columns? My curiosity is: A.) At what column count does this happen? B.) If Thrift is only

[BETA RELEASE] Apache Cassandra 1.1.0-beta1 released

2012-02-20 Thread Sylvain Lebresne
The Cassandra team is pleased to announce the release of the first beta for the future Apache Cassandra 1.1. Let me first stress that this is beta software and as such is *not* ready for production use. The goal of this release is to give a preview of what will become Cassandra 1.1 and to get

Re: Data Modeling

2012-02-20 Thread aaron morton
If you want to read all possible values for a field, where the field has 1 million possible values it's going to take time. No matter what data model you use. That said, the first model I would use is: CF: Customer Use this as a canonical record of the properties a customer has. row_key :

Re: Final buffer length 4690 to accomodate data size of 2347 for RowMutation error caused node death

2012-02-20 Thread aaron morton
Can you provide the definition for the PlayerCity CF, the client you are using and any other info about the mutation ? It's a strange looking mutation. There are a lot of super columns are not marked for delete and do not have nay columns in them. The error occurred because it took 3 bytes

Re: Max TTL?

2012-02-20 Thread aaron morton
Nothing obvious. Trying turning up the logging to DEBUG and watch what happens, or use sstable2json to dump the sstables. Expired columns are not purged until the sstable is compacted. Cheers - Aaron Morton Freelance Developer @aaronmorton http://www.thelastpickle.com On

Re: Help needed to solve an issue with Cassandra cluster

2012-02-20 Thread aaron morton
If you are starting out I recommend using at least 0.8, the memory management will make you life so much easier. if the schema is not fully replicated solve that problem first. Using cassandra-cli and run the describe cluster command. If there are multiple schemas listed nuke it and start

Re: Wide Row Performance Index Question

2012-02-20 Thread aaron morton
this http://thelastpickle.com/2011/07/04/Cassandra-Query-Plans/ A.) At what column count does this happen? Based on column serialised size https://github.com/apache/cassandra/blob/trunk/conf/cassandra.yaml#L325 B.) If Thrift is only getting slices of a large row (column_start=X,

Re: Data Modeling

2012-02-20 Thread alexis coudeyras
Thanks a lot Aaron, I will try your idea tomorow. For CF PropertyValues, instead of property_value:customer_id should I do customer_id:property_value to preserve the same order for each property_value ? (there will be custom null value). Why is using only columns names faster ? It seems that

unsubscribe

2012-02-20 Thread Bhupendra Babu
-- Thanks. Bhupendra B Babu

Re: Wide Row Performance Index Question

2012-02-20 Thread Todd Burruss
I believe you will see a slight unbalance regardless of your RF with very wide rows, if they are of varying sizes. one node may get a very wide row and another node may get a not so wide row. it's all based on the key. From: aaron morton aa...@thelastpickle.commailto:aa...@thelastpickle.com

Re: unsubscribe

2012-02-20 Thread Eric Evans
http://goo.gl/lQJC2 On Mon, Feb 20, 2012 at 3:39 PM, Bhupendra Babu bb...@apple.com wrote: -- Thanks. Bhupendra B Babu -- Eric Evans Acunu | http://www.acunu.com | @acunu

about the deep clone operation in the cassadra source code

2012-02-20 Thread zhangcheng
When I read the cassandra code, there are too many deep clone operation. Is there any risk of OutOfMemoryException? When flushing the memtable to disk, there is a deep clone, and when the disk is slow down, this will lead to a severe OOM. Am I right? Thanks. 2012-02-21 zhangcheng