Re: Cassandra disk space utilization WAY higher than I would expect

2010-07-07 Thread Jordan Pittier - Rezel
I see the same thing here. I have tried to do some maths including timestamps, columns name, keys and raw data but in the end cassandra reports a cluster size from 2 to 3 times bigger than the raw data. I am surely missing something in my formula + i have a lot of free hard drive space, so it's

Re: Uneven distribution using RP

2010-06-20 Thread Jordan Pittier - Rezel
Hi, Have you tried nodetool repair (or cleanup) on your nodes ? On Sun, Jun 20, 2010 at 4:16 PM, James Golick jamesgol...@gmail.com wrote: I just increased my cluster from 2 to 4 nodes, and RF=2 to RF=3, using RP. The tokens seem pretty even on the ring, but two of the nodes are far more

Re: Uneven distribution using RP

2010-06-20 Thread Jordan Pittier - Rezel
looked roughly even after that, but a couple of days later, it's looking pretty uneven. On Sun, Jun 20, 2010 at 10:21 AM, Jordan Pittier - Rezel jor...@rezel.net wrote: Hi, Have you tried nodetool repair (or cleanup) on your nodes ? On Sun, Jun 20, 2010 at 4:16 PM, James Golick jamesgol

Re: Running a very small cluster

2010-06-09 Thread Jordan Pittier - Rezel
Hi, Regarding point c), you should ask your self, what is good performance for me ?. The read performance mainly depends on how fast your hard drives are and how many rows you can maintain in cache. With such a small cluster, if you want good read performance, you better have fast hard drive and

Re: what/how do you guys monitor slow nodes?

2010-05-11 Thread Jordan Pittier - Rezel
For sure you have to pay particular attention to memory allocation on each node, especially be sure your servers dont swap. Then you can monitor how load are balanced among your nodes (nodetools -h XX ring). On Tue, May 11, 2010 at 11:46 PM, S Ahmed sahmed1...@gmail.com wrote: If you have 3-4

Re: performance tuning - where does the slowness come from?

2010-05-04 Thread Jordan Pittier
I'm facing the same issue with swap. It only occurs when I perform read operations (write are very fast :)). So I can't help you with the memory probleme. But to balance the load evenly between nodes in cluster just manually fix their token.(the formula is i * 2^127 / nb_nodes). Jordzn On Tue,

Re: Re: why the sum of all the nodes' loads is much bigger than the size of the inserted data?

2010-04-30 Thread Jordan Pittier
Dont forget to count timestamps for each column. 2010/4/30 Bingbing Liu rucb...@gmail.com hi, thanks for your help. i run the nodetool -h compact but the load keep the same , is there anyone can tell me why? 2010-04-30 -- Bingbing Liu

Re: batch_mutate - PHP

2010-04-27 Thread Jordan Pittier
Hi, Here is a working example : $mutation_map = array($key=array(Standard1 = array())); for($column_name=0; $column_name$options['numcolumns']; $column_name++) { $column = new cassandra_Column(array('name' = $column_name, 'value' = 'put your data here', 'timestamp' = time()));

Re: [RELEASE] 0.6.0

2010-04-13 Thread Jordan Pittier
For those who can't wait : http://perso.rezel.net/cassandra_0.6.0-1_all.deb md5sum is 6dd71e18e1e0239e50302098d395536e Based on https://svn.apache.org/repos/asf/cassandra/tags/cassandra-0.6.0/ On Tue, Apr 13, 2010 at 7:43 PM, Ned Wolpert ned.wolp...@imemories.comwrote: Is 0.6.0 a repackage of

Re: compare cassandra read n write results

2010-04-12 Thread Jordan Pittier
First, read carefully and understand : http://wiki.apache.org/cassandra/ThriftExamples#PHP But you really shouldn't bother with benchmarks. Ask yourself this question : what if my Cassandra performs at 5k operation/s ? And what about 3k op/s?. In other terms why are you benchmarking ?. You've got