record not available on created system when other system (Node\Seed) is shutdown

2011-09-12 Thread RAJASHEKAR REDDY
Hi,   I Installed Cassandra 0.8.4 on two systems configure like below   System 1: IP 10.1.1.1 which is acting as seed   - seeds: 10.1.1.1   listen_address: 10.1.1.1   rpc_address: 10.1.1.1      System 2: IP 10.1.1.2 which is acting as node    - seeds: 10.1.1.1  listen_address: 10.1.1.2  

memtable flush thresholds

2011-09-12 Thread Sorin Julean
Hi, I've checked the memtable flush (cassandra 0.8.4) and it seams to me it hapens sooner then the threshold is reached. Here's the threshould's (the default ones calculated for a heap size of -Xmx1980M): ColumnFamily: idx_graphable (Super) Key Validation Class:

Index search in provided list of rows (list of rowKeys).

2011-09-12 Thread Evgeniy Ryabitskiy
Hi, We have an issue to search over Cassandra and we are using Sphinx for indexing. Because of Sphinx architecture we can't use range queries over all fields that we need to. So we have to run Sphinx Query first to get List of rowKeys and perform additional range filtering over column values.

Re: memtable flush thresholds

2011-09-12 Thread Jonathan Ellis
see memtable_total_space_in_mb at http://thelastpickle.com/2011/05/04/How-are-Memtables-measured/ On Mon, Sep 12, 2011 at 6:55 AM, Sorin Julean sorin.jul...@gmail.com wrote: Hi,  I've checked the memtable flush (cassandra 0.8.4)  and it seams to me it hapens sooner then the threshold is

Re: memtable flush thresholds

2011-09-12 Thread Sorin Julean
Thanks Jonathan ! memtable_total_space_in_mb is the threshold that is reached. Kind regards, Sorin On Mon, Sep 12, 2011 at 3:16 PM, Jonathan Ellis jbel...@gmail.com wrote: see memtable_total_space_in_mb at http://thelastpickle.com/2011/05/04/How-are-Memtables-measured/ On Mon, Sep 12,

Replace Live Node

2011-09-12 Thread Kyle Gibson
Version=0.7.8 I have a 3 node cluster with RF=3, how would I move data from a live node to a replacement node? I tried an autobootstrap + decomission, but I got this error on the live node: Exception in thread main java.lang.IllegalStateException: replication factor (3) exceeds number of

Re: Not all data structures need timestamps (and don't require wasted memory).

2011-09-12 Thread David Jeske
On Sat, Sep 3, 2011 at 8:26 PM, Kevin Burton bur...@spinn3r.com wrote: The point is that replication in Cassandra only needs timestamps to handle out of order writes … for values that are idempotent, this isn't necessary. The order doesn't matter. I believe this is a mis-understanding of

Re: Replace Live Node

2011-09-12 Thread Jeremy Hanna
Yeah - I would bootstrap at initial_token of -1 the current one. Then once that has bootstrapped, then decommission the old one. Avoid trying to use removetoken on anything before 0.8.3. Use decommission if you can if you're dealing with a live node. On Sep 12, 2011, at 10:42 AM, Kyle

Re: what's the difference between repair CF separately and repair the entire node?

2011-09-12 Thread Peter Schuller
I am using 0.7.4.  so it is always okay to do the routine repair on Column Family basis? thanks! It's okay but won't do what you want; due to a bug you'll see streaming of data for other column families than the one you're trying to repair. This will be fixed in 1.0. -- / Peter Schuller

Re: Not all data structures need timestamps (and don't require wasted memory).

2011-09-12 Thread David Jeske
After writing my message, I recognized a scenerio you might be referring to Kevin. If I understand correctly, you're not referring to set-membership in the general sense, where one could add and remove entries. General set-membership, in the context of eventual-consistency, requires timestamps.

AntiEntropyService.getNeighbors pulls information from where?

2011-09-12 Thread Sasha Dolgy
This relates to the issue i opened the other day: https://issues.apache.org/jira/browse/CASSANDRA-3175 .. basically, 'nodetool ring' throws an exception on two of the four nodes. In my fancy little world, the problems appear to be related to one of the nodes thinking that someone is their

Re: Replace Live Node

2011-09-12 Thread Kyle Gibson
What could you do if the initial_token is 0? On Mon, Sep 12, 2011 at 1:09 PM, Jeremy Hanna jeremy.hanna1...@gmail.com wrote: Yeah - I would bootstrap at initial_token of -1 the current one.  Then once that has bootstrapped, then decommission the old one.  Avoid trying to use removetoken on

Re: Replace Live Node

2011-09-12 Thread Konstantin Naryshkin
The ring wraps around, so the value before 0 is the max possible token. I believe that it is 2**127 -1 . - Original Message - From: Kyle Gibson kyle.gib...@frozenonline.com To: user@cassandra.apache.org Sent: Monday, September 12, 2011 3:30:20 PM Subject: Re: Replace Live Node What

Re: Replace Live Node

2011-09-12 Thread Jeremy Hanna
I believe you'd need 2^127 - 1, which is 170141183460469231731687303715884105727 On Sep 12, 2011, at 2:30 PM, Kyle Gibson wrote: What could you do if the initial_token is 0? On Mon, Sep 12, 2011 at 1:09 PM, Jeremy Hanna jeremy.hanna1...@gmail.com wrote: Yeah - I would bootstrap at

cleanup / move

2011-09-12 Thread David McNelis
While it would certainly be preferable to not run a cleanup and a move at the same time on the same node, is there a techincal problem with running a nodetool move on a node while a cleanup is running? Or if its possible to gracefully kill a cleanup, so that a move can be run and then cleanup

Re: Replace Live Node

2011-09-12 Thread Kyle Gibson
So to move data from node with token 0, the new node needs to have initial token set to 170141183460469231731687303715884105727 ? Another idea: could I move token to 1, and then use token 0 on the new node? On Mon, Sep 12, 2011 at 3:38 PM, Jeremy Hanna jeremy.hanna1...@gmail.com wrote: I

Re: Replace Live Node

2011-09-12 Thread Jeremy Hanna
So to move data from node with token 0, the new node needs to have initial token set to 170141183460469231731687303715884105727 ? I would do this route. Another idea: could I move token to 1, and then use token 0 on the new node? nodetool move prior to 0.8 is a very heavy operation.

balancing issue with Random partitioner

2011-09-12 Thread David McNelis
We are running the datastax .8 rpm distro. We have a situation where we have 4 nodes and each owns 25% of the keys. However the last node in the ring does not seem to be getting much of a load at all. We are using the random partitioner, we have a total of about 20k keys that are sequential...

Re: Index search in provided list of rows (list of rowKeys).

2011-09-12 Thread aaron morton
Just checking, you want an API call like this ? multiget_filtered_slice(keys, column_parent, predicate, filter_clause, consistency_level) Where filter_clause is an IndexClause. It's a bit messy. is there no way to express this as a single get_indexed_slice() call? With a == index

Re: balancing issue with Random partitioner

2011-09-12 Thread Jonathan Ellis
Looks kind of like the 4th node was added to the cluster w/o bootstrapping. On Mon, Sep 12, 2011 at 3:59 PM, David McNelis dmcne...@agentisenergy.com wrote: We are running the datastax .8 rpm distro.  We have a situation where we have 4 nodes and each owns 25% of the keys.  However the last

Re: balancing issue with Random partitioner

2011-09-12 Thread David McNelis
Auto-bootstrapping is turned on and the node had been started several hours ago. Since the node already shows up as part of the ring I would imagine that nodetool join wouldn't do anything.Is there a command to jumpstart bootstrapping? On Mon, Sep 12, 2011 at 4:22 PM, Jonathan Ellis

Re: what's the difference between repair CF separately and repair the entire node?

2011-09-12 Thread Jim Ancona
On Mon, Sep 12, 2011 at 1:44 PM, Peter Schuller peter.schul...@infidyne.com wrote: I am using 0.7.4.  so it is always okay to do the routine repair on Column Family basis? thanks! It's okay but won't do what you want; due to a bug you'll see streaming of data for other column families than

Re: Index search in provided list of rows (list of rowKeys).

2011-09-12 Thread Evgeniy Ryabitskiy
Something like this. Actually I think it's better to extend get_indexed_slice() API instead of creating new one thrift method. I wish to have something like this: //here we run query to external search engine Listbyte[] keys = performSphinxQuery(someFullTextSearchQuery); IndexClause indexClause

Cassandra performance on a virtual network....

2011-09-12 Thread Chris Marino
Hello everyone, I wanted to tell you about some performance benchmarking we have done with Cassandra running in EC2 on a virtual network. The purpose of the experiment was to see how running Cassandra on a virtual network could simplify operational complexity and to determine the performance

Re: AntiEntropyService.getNeighbors pulls information from where?

2011-09-12 Thread aaron morton
I'm pretty sure I'm behind on how to deal with this problem. Best I know is to start the node with -Dcassandra.load_ring_state=false as a JVM option. But if the ghost IP address is in gossip it will not work, and it should be in gossip. Does the ghost IP show up in nodetool ring ? Anyone

Re: cleanup / move

2011-09-12 Thread aaron morton
is there a techincal problem with running a nodetool move on a node while a cleanup is running? Cleanup is removing data that the node is no longer responsible for while move is first removing *all* data from the node and then streaming new data to it. I'd put that in the crossing the

Re: balancing issue with Random partitioner

2011-09-12 Thread aaron morton
Try a reapir on 100.5 , it will then request the data from the existing nodes. You will then need to clean on the existing three nodes once the repair has completed. Cheers - Aaron Morton Freelance Cassandra Developer @aaronmorton http://www.thelastpickle.com On 13/09/2011,

Re: AntiEntropyService.getNeighbors pulls information from where?

2011-09-12 Thread Sasha Dolgy
use system; del LocationInfo[52696e67]; i ran this on the nodes that had the problems. stopped, started the nodes, it re-did it's job job done. all fixed with a new bug! https://issues.apache.org/jira/browse/CASSANDRA-3186 On Tue, Sep 13, 2011 at 2:09 AM, aaron morton

Re: what's the difference between repair CF separately and repair the entire node?

2011-09-12 Thread Yan Chunlu
I think it is a serious problem since I can not repair. I am using cassandra on production servers. is there some way to fix it without upgrade? I heard of that 0.8.x is still not quite ready in production environment. thanks! On Tue, Sep 13, 2011 at 1:44 AM, Peter Schuller

Re: Cassandra -f problem

2011-09-12 Thread Hernán Quevedo
Hi, Roshan. This is great support, amazing support; not used to it :) Thanks for the reply. Well I think java is installed correctly, I mean, the java -version command works on a terminal, so the PATH env variable is correctly set, right? I downloaded the JDK7 and put it on opt/java/ and then

Re: Cassandra -f problem

2011-09-12 Thread Roshan Dawrani
Hi, Do you have JAVA_HOME exported? If not, can you export it and retry? Cheers. On Tue, Sep 13, 2011 at 8:59 AM, Hernán Quevedo alexandros.c@gmail.comwrote: Hi, Roshan. This is great support, amazing support; not used to it :) Thanks for the reply. Well I think java is installed