Cassandra data distribution and configuration settings

2009-11-17 Thread Jon Graham
Hello Everyone, I am running a 10 node Cassandra cluster using a recent cassandra trunk. After inserting a million data records using keys of 128 character lower case hex digits and 1024 data values, I noticed that most all of the is accumulated on only 3 nodes. I am using the

Re: Cassandra data distribution and configuration settings

2009-11-17 Thread Jon Graham
at 9:19 AM, TuxRacer69 tuxrace...@gmail.com wrote: Jon, what is your key distribution? did you set InitialToken/ on each node according to this distribution? TR Jon Graham wrote: Hello Everyone, I am running a 10 node Cassandra cluster using a recent cassandra trunk. After

Cassandra backup and restore procedures

2009-11-18 Thread Jon Graham
Hello Everyone, Is there a recommended backup/restore procedure to be able to recover a failed node? How does Cassandra keep track of a node's identity? Should a replacement node keep the same IP address/DNS name as the original node? Does a node still receive data while a nodeprobe snapshot

Re: Cassandra backup and restore procedures

2009-11-18 Thread Jon Graham
determined by the position in the cluster ring? Thanks, Jon On Wed, Nov 18, 2009 at 10:30 AM, Jonathan Ellis jbel...@gmail.com wrote: On Wed, Nov 18, 2009 at 12:05 PM, Jon Graham sjclou...@gmail.com wrote: Hello Everyone, Is there a recommended backup/restore procedure to be able

Cassandra Database Format Compatibility

2009-11-23 Thread Jon Graham
Hello Everyone, Will the Cassandra database format for the current Cassandra source trunk be compatible with the 0.5 Cassandra release? If there are database version differences, is there a migration path to convert older data formats to the new versions? Is there an estimated release date for

Adjusting Token Spaces and Rebalancing Data

2010-02-24 Thread Jon Graham
Hello, I have 6 node Cassandra 0.5.0 cluster using org.apache.cassandra.dht.OrderPreservingPartitioner with replication factor 3. I mistakenly set my tokens to the wrong values, and have all the data being stored on the first node (with replicas on the seconds and third nodes) Does Cassandra

Re: Adjusting Token Spaces and Rebalancing Data

2010-03-01 Thread Jon Graham
to find the nodetool command for cassandra 0.5.0. Is this a separate package/tool? Thanks, Jon On Wed, Feb 24, 2010 at 8:17 PM, Jonathan Ellis jbel...@gmail.com wrote: nodeprobe loadbalance and/or nodeprobe move http://wiki.apache.org/cassandra/Operations On Wed, Feb 24, 2010 at 6:17 PM, Jon

Re: Adjusting Token Spaces and Rebalancing Data

2010-03-01 Thread Jon Graham
...@gmail.com wrote: nodetool is the 0.6 replacement for nodeprobe. the stream info is new in that version. (0.6 beta release is linked from http://wiki.apache.org/cassandra/GettingStarted) -Jonathan On Mon, Mar 1, 2010 at 12:40 PM, Jon Graham sjclou...@gmail.com wrote: Hello Everyone

Re: Adjusting Token Spaces and Rebalancing Data

2010-03-01 Thread Jon Graham
token space distribution. Thanks for your help, Jon On Mon, Mar 1, 2010 at 11:55 AM, Jonathan Ellis jbel...@gmail.com wrote: On Mon, Mar 1, 2010 at 1:44 PM, Jon Graham sjclou...@gmail.com wrote: Can I tell if the load balancing operaion is still running ok or if it has terminated

Re: Adjusting Token Spaces and Rebalancing Data

2010-03-01 Thread Jon Graham
? Thanks, Jon On Mon, Mar 1, 2010 at 2:54 PM, Jonathan Ellis jbel...@gmail.com wrote: On Mon, Mar 1, 2010 at 3:18 PM, Jon Graham sjclou...@gmail.com wrote: Thanks Jonathan. It seems like the load balance operation isn't moving. I haven't seen any data file time changes in 2 hours

Re: Adjusting Token Spaces and Rebalancing Data

2010-03-02 Thread Jon Graham
, Jon On Mon, Mar 1, 2010 at 4:55 PM, Jonathan Ellis jbel...@gmail.com wrote: On Mon, Mar 1, 2010 at 5:39 PM, Jon Graham sjclou...@gmail.com wrote: Reached an EOL or something bizzare occured. Reading from: /192.168.2.13 BufferSizeRemaining: 16 This one is harmless java.io.IOException: Value

Re: Adjusting Token Spaces and Rebalancing Data

2010-03-02 Thread Jon Graham
by 795: http://bugs.sun.com/bugdatabase/view_bug.do?bug_id=6253145. It is marked fixed in JDK 6u18, so try upgrading to that. -Jonathan On Tue, Mar 2, 2010 at 10:46 AM, Jon Graham sjclou...@gmail.com wrote: Hello, I am running a 32-bit linux version 2.6.27.24. My original data set

Cassandra 0.5.1 get_key_range problem

2010-03-12 Thread Jon Graham
Hello, When using the get_key_range method with ConsistencyLevel.ONE an entire block of keys is not returned. I loop over the get_key_range method, advancing the start key after each call (requesting 8K keys per call). When running the program several times, I got the same results with large key