R: Re: Schema clone ...

2012-01-10 Thread cbert...@libero.it
* Grab the system sstables from one of the 0.7 nodes and spin up a temp 1.0 machine them, then use the command. Probably I'm still sleeping but I can't get what I want! :-( I've copied the SSTables of a node to my own computer where I installed a Cassandra 1.0 just for the purpose.

AW: How to control location of data?

2012-01-10 Thread Roland Gude
Hi, i think everything is called a replica so if data is on 3 nodes you have 3 replicas. There is no such thing as an original. A partitioner decides into which partition a piece of data belongs A replica placement strategy decides which partition goes on which node You cannot suppress the

Re: AW: How to control location of data?

2012-01-10 Thread Andreas Rudolph
Hi! Thank you for your last reply. I'm still wondering if I got you right... ... A partitioner decides into which partition a piece of data belongs Does your statement imply that the partitioner does not take any decisions at all on the (physical) storage location? Or put another way: What

R: Re: AW: How to control location of data?

2012-01-10 Thread cbert...@libero.it
In each node of the ring has a unique Token which representing the node's logical position in the cluster. When you perform an operation on a row is calculated a token based on this row ... the node-token closest to the row-token will store the data (and also the RF-1 remaining nodes) -- this

DataStax offers two new advanced Apache Cassandra classes (San Mateo)

2012-01-10 Thread Lynn Bender
DataStax is now offering two advanced classes for Apache Cassandra Tuesday, February 14 - Wednesday February 15 - San Mateo Advanced Modeling and Analytics with Apache Cassandra Additional details and RSVP at Eventbrite: http://cassandra-modeling-sanmateo.eventbrite.com/ Thursday, February 16 -

AW: AW: How to control location of data?

2012-01-10 Thread Roland Gude
Each node in the cluster is assigned a token (can be done automatically - but usually should not) The token of a node is the start token of the partition it is responsible for (and the token of the next node is the end token of the current tokens partition) Assume you have the following

Announcing Countandra 0.5

2012-01-10 Thread Milind Parikh
Inspired by twitter's rainbird project, Countandra is a hierarchical distributed counting engine at scale. It provides a complete http based interface to both posting events and getting queries. The syntax of a event posting is done in a FORMS compatible way. The result of the query is emitted in

Re: Schema clone ...

2012-01-10 Thread aaron morton
* Grab the system sstables from one of the 0.7 nodes and spin up a temp 1.0 machine them, then use the command. Grab the *system* tables Migrations , Schema etc. in cassandra/data/system Cheers - Aaron Morton Freelance Developer @aaronmorton

Re: How to control location of data?

2012-01-10 Thread aaron morton
What we want is to partition the cluster with respect to key spaces. Why do you want to do this ? (It's probably a bad idea) Background in here on the partitioner, placement strategy and the snitch http://thelastpickle.com/2011/02/07/Introduction-to-Cassandra/ Now here's how to do it…. Use

Re: Should I throttle deletes?

2012-01-10 Thread Maxim Potekhin
Thanks, this makes sense. I'll try that. Maxim On 1/6/2012 10:51 AM, Vitalii Tymchyshyn wrote: Do you mean on writes? Yes, your timeouts must be so that your write batch could complete until timeout elapsed. But this will lower write load, so reads should not timeout. Best regards, Vitalii

Syncing across environments

2012-01-10 Thread David McNelis
Is anyone familiar with any tools that are already available to allow for configurable synchronization of different clusters? Specifically for purposes of development, i.e. Dev, staging, test, and production cassandra environments, so that you can easily plug in the information that you want to

Example Hadoop JobConf for reading from Cassandra?

2012-01-10 Thread Kevin Burton
I'm trying to port the Hadoop InputFormat to Peregrine (another map reduce impl I'm working on) … http://peregrine_mapreduce.bitbucket.org/ The problem is that I can't get it to work with my config because the documentation is a bit sparse. I could probably spend a ton of time tracking this

Re: AW: How to control location of data?

2012-01-10 Thread Maki Watanabe
Small correction: The token range for each node is (Previous_token, My_Token]. ( means exclusive and ] means inclusive. So N1 is responsible from X+1 to A in following case. maki 2012/1/11 Roland Gude roland.g...@yoochoose.com: Each node in the cluster is assigned a token (can be done