Re: Cassandra driver performance question...

2013-06-24 Thread Jabbar Azam
Hello tony, I couldnt reply earlier because I've been decorating over the weekend so have been a bit busy. Let me know what's happens. Out of couriosity why are you using and not a cql3 native driver? Thanks Jabbar Azam On 24 Jun 2013 00:32, Tony Anecito adanec...@yahoo.com wrote: Hi

Re: Updated sstable size for LCS, ran upgradesstables, file sizes didn't change

2013-06-24 Thread Hiller, Dean
We would be very very interested in your results. We currently run 10M but have heard of 256M sizes as well. Please let us know what you find out. Thanks, Dean From: Andrew Bialecki andrew.biale...@gmail.commailto:andrew.biale...@gmail.com Reply-To:

AssertionError: Unknown keyspace?

2013-06-24 Thread Hiller, Dean
I haven't seen this error in a long time. We just received the below error in production when rebuilding a node…any ideas on how to get around this? We had rebuilt 3 other nodes already I think(we have been swapping hardware) ERROR 06:32:21,474 Exception in thread Thread[ReadStage:1,5,main]

Re: AssertionError: Unknown keyspace?

2013-06-24 Thread Hiller, Dean
Ah, so digging deeper, it is not bootstrapping. How do I force the node to bootstrap? (this is version 1.2.2 and the other nodes somehow knew to bootstrap automatically but this one I need to force for some reason). I remember there was a property for this. NOTE: I enabled some debug logs and

Re: AssertionError: Unknown keyspace?

2013-06-24 Thread Hiller, Dean
Oh shoot, this is a seed node. Is there documentation on how to bootstrap a seed node? If I have seeds of A, B, C for every machine on the ring and I am bootstrapping node B, do I just modify cassandra.yaml and remove node B from the yaml file temporarily and boot it up (Note, I still received

quick question on seed nodes configuration

2013-06-24 Thread Hiller, Dean
For ease of use, we actually had a single cassandra.yaml deployed to every machine and a script that swapped out the token and listen address. I had seed nodes ip1,ip2,ip3 as the seeds but what I didn't realize was then that these nodes had themselves as seeds. I am assuming that should never

Re: Cassandra driver performance question...

2013-06-24 Thread Tony Anecito
Hi Jabbar,   I am using JDBC driver because almost no examples exist about what you mention. Even most of the JDBC examples I find do not work because they are incomplete or out of date. If you have a good reference about what you mentioned I can try it.   As I menioned I got selects to work

Re: quick question on seed nodes configuration

2013-06-24 Thread julien Campan
Hi , The seeds are only used when a node appears in the cluster. At this moment it chooses a seed (in the same dc) in order to have some information. So, the most secure way is to write all your other nodes as seed, but in fact you need only one up. if you think that you will never have 3 node

Re: Cassandra driver performance question...

2013-06-24 Thread Jabbar Azam
Hello Tony, This came out recently http://www.datastax.com/doc-source/developer/java-driver/index.html I can't vouch for performance but the documentation is ok and it works. I'm using it on a side project myself. There is also astyanax by netflix and it also supports CQL 3

Hadoop/Cassandra 1.2 timeouts

2013-06-24 Thread Brian Jeltema
I'm having problems with Hadoop job failures on a Cassandra 1.2 cluster due to Caused by: TimedOutException() 2013-06-24 11:29:11,953 INFO Driver -at org.apache.cassandra.thrift.Cassandra$get_range_slices_result.read(Cassandra.java:12932) This is running on a 6-node cluster,

Re: Upgrade from 1.1.10 to 1.2.4

2013-06-24 Thread Robert Coli
On Sun, Jun 23, 2013 at 2:31 AM, Ananth Gundabattula agundabatt...@threatmetrix.com wrote: Looks like the cause of the error was because of not specifying num_tokens in the cassandra.yaml file. I was under the impression that setting a value of num_tokens will override the initial_token value .

Re: AssertionError: Unknown keyspace?

2013-06-24 Thread Robert Coli
On Mon, Jun 24, 2013 at 6:04 AM, Hiller, Dean dean.hil...@nrel.gov wrote: Oh shoot, this is a seed node. Is there documentation on how to bootstrap a seed node? If I have seeds of A, B, C for every machine on the ring and I am bootstrapping node B, do I just modify cassandra.yaml and remove

Re: Upgrade from 1.1.10 to 1.2.4

2013-06-24 Thread Ananth Gundabattula
Hello Rob, I ran into the stack trace when the situation was : num_tokens unset ( by this I mean not specifying anything ) and initial_token set to some value. I was initially under the impression that specifying num_tokens will over ride the initial_token value and hence left num_tokens blank.

Re: AssertionError: Unknown keyspace?

2013-06-24 Thread Wei Zhu
I have got bitten by it once. At least there should be a message saying, there is no streaming data since it's a seed node.  I searched the source code, the message was there and it got removed at certain version. -Wei  From: Robert Coli rc...@eventbrite.com

sorting columns by time

2013-06-24 Thread Bill Hastings
Hi All I have a requirement where I need to have my columns sorted by the creation time. However I would like to have my own naming scheme for the columns and not use TimeUUID as column names. Please advice as to how I can achieve this in Cassandra as this has been pretty confusing to me.

Re: AssertionError: Unknown keyspace?

2013-06-24 Thread Hiller, Dean
Yes, it would be nice at startup just to say don't list your seed node as this node and then fail out and we would have known this a long long time ago ;). Dean From: Wei Zhu wz1...@yahoo.commailto:wz1...@yahoo.com Reply-To: user@cassandra.apache.orgmailto:user@cassandra.apache.org

Re: sorting columns by time

2013-06-24 Thread Hiller, Dean
Send the naming scheme you desire. Is long time since epoch ok? Or a composite name of time since epoch + (something else) Dean From: Bill Hastings bllhasti...@gmail.commailto:bllhasti...@gmail.com Reply-To: user@cassandra.apache.orgmailto:user@cassandra.apache.org

Re: AssertionError: Unknown keyspace?

2013-06-24 Thread Wei Zhu
Here is the line in the source code for 1.1.0: https://github.com/apache/cassandra/blob/cassandra-1.1.0/src/java/org/apache/cassandra/service/StorageService.java#L549 And it's refactored later to this, and the message was removed.

Re: [Cassandra] Running node tool cleanup

2013-06-24 Thread Emalayan Vairavanathan
Thank you Robert and others for answering my questions. I started to play with nodetool  and I have few more questions. Does nodetool cleanup run synchronously or asynchronously ? If it is running asynchronously is there any way to monitor the progress ? Thank you Emalayan

Re: CAS and long lived locks

2013-06-24 Thread sankalp kohli
Assuming that database migration is a one time and rare operation, why don't you try to grab a lock for a short time. If you are able to grab it, then you can renew it for a longer time. This will make sure that in case of collision, all contenders wont be locked out for long time. You can use

Re: CAS and long lived locks

2013-06-24 Thread sankalp kohli
Also CAS is in 2.0 which is not production ready so I am not sure how you will use it. On Mon, Jun 24, 2013 at 4:35 PM, sankalp kohli kohlisank...@gmail.comwrote: Assuming that database migration is a one time and rare operation, why don't you try to grab a lock for a short time. If you are

Re: Date range queries

2013-06-24 Thread Christopher J. Bottaro
Yes, that makes sense and that article helped a lot, but I still have a few questions... The created_at in our answers table is basically used as a version id. When a user updates his answer, we don't overwrite the old answer, but rather insert a new answer with a more recent timestamp (the

Re: Cassandra terminates with OutOfMemory (OOM) error

2013-06-24 Thread Mohammed Guller
No deletes. In my test, I am just writing and reading data. There is a lot of GC, but only on the younger generation. Cassandra terminates before the GC for old generation kicks in. I know that our queries are reading an unusual amount of data. However, I expected it to throw a timeout

Counter value becomes incorrect after several dozen reads writes

2013-06-24 Thread Josh Dzielak
I have a loop that reads a counter, increments it by some integer, then goes off and does about 500ms of other work. After about 10 iterations of this loop, the counter value *sometimes* appears to be corrupted. Looking at the logs, a sequence that just happened is: Read counter - 15000

Re: CAS and long lived locks

2013-06-24 Thread Blair Zajac
I normally have migrations run at server startup and depending upon the complexity, they could run for a while if they need to do per-row data transformations. I don't get the point regarding collisions, somebody is going to be locked out for a while, so getting the lock for a short period

Re: CAS and long lived locks

2013-06-24 Thread Blair Zajac
Our product is in development now so we don't plan on going into production later when 2.0.0 is out. Blair On 06/24/2013 01:36 PM, sankalp kohli wrote: Also CAS is in 2.0 which is not production ready so I am not sure how you will use it. On Mon, Jun 24, 2013 at 4:35 PM, sankalp kohli

Re: Counter value becomes incorrect after several dozen reads writes

2013-06-24 Thread Arthur Zubarev
Hi Josh, are you looking at the read counter produced by cfstats? If so it is not for a CF, but the entire KS and not tied to a specific operation, but rather per the entire lifetime of JVM. Just in case, some supporting info:

How to do a CAS UPDATE on single column CF?

2013-06-24 Thread Blair Zajac
How does one do an atomic update in a column family with a single column? I have a this CF CREATE TABLE schema_migrations ( version TEXT PRIMARY KEY, ) WITH COMPACTION = {'class': 'LeveledCompactionStrategy'}; that records which database migrations have been applied. I want to do a

Mixing CAS UPDATE and non-CAS DELETE

2013-06-24 Thread Blair Zajac
Looking at the CAS unit tests [1], if one does a CAS UPDATE to create a ROW: UPDATE test SET v1 = 2, v2 = 'foo' WHERE k = 0 IF NOT EXISTS there isn't a CAS DELETE FROM that only uses the partition key. You can do this to delete the row using CAS: DELETE FROM test WHERE k = 0 IF v1 =

Re: How to do a CAS UPDATE on single column CF?

2013-06-24 Thread Arthur Zubarev
On 06/24/2013 11:23 PM, Blair Zajac wrote: CAS UPDATE Since when C* has IF NOT EXISTS in DML part of CQL? -- Regards, Arthur

copy data between clusters

2013-06-24 Thread S C
I have a scenario here. I have a cluster A and cluster B running on cassandra 1.1. I need to copy data from Cluster A to Cluster B. Cluster A has few keyspaces that I need to copy over to Cluster B. What are my options? Thanks,SC

Re: copy data between clusters

2013-06-24 Thread Arthur Zubarev
On 06/24/2013 11:35 PM, S C wrote: I have a scenario here. I have a cluster A and cluster B running on cassandra 1.1. I need to copy data from Cluster A to Cluster B. Cluster A has few keyspaces that I need to copy over to Cluster B. What are my options? Thanks, SC I am thinking of

Re: NREL has released open source Databus on github for time series data

2013-06-24 Thread aaron morton
Hi Dean, Does this handle rollup aggregates along with the time series data ? I had a quick look at the links and could not see anything. Cheers Aaron - Aaron Morton Freelance Cassandra Consultant New Zealand @aaronmorton http://www.thelastpickle.com On

Re: [Cassandra] Replacing a cassandra node with one of the same IP

2013-06-24 Thread aaron morton
so I am just wondering if this means the hinted handoffs are also updated to reflect the new Cassandra node uuid. Without checking the code I would guess not. Because it would involve a potentially large read / write / delete to create a new row with the same data. And Hinted Handoff is an

Re: Counter value becomes incorrect after several dozen reads writes

2013-06-24 Thread Josh Dzielak
Hi Arthur, This is actually for a column in a counter column family, i.e. CounterColumnType. Will check out that thread though, thanks. Best, Josh -- Josh Dzielak VP Engineering • Keen IO Twitter • @dzello (https://twitter.com/dzello) Mobile • 773-540-5264 On Monday, June 24, 2013 at