Re: 15 seconds to increment 17k keys?

2011-09-06 Thread Oleg Anastastasyev
in the family. There are millions of rows. Each operation consists of doing a batch_insert through pycassa, which increments ~17k keys. A majority of these keys are new in each batch. Each operation is taking up to 15 seconds. For our system this is a significant bottleneck. Try to

Re: Using 5-6 bytes for cassandra timestamps vs 8…

2011-09-06 Thread Oleg Anastastasyev
I have a patch for trunk which I just have to get time to test a bit before I submit. It is for super columns and will use the super columns timestamp as the base and only store variant encoded offsets in the underlying columns.  Could you please measure how much real benefit it brings (in

[RELEASE] Apache Cassandra 0.7.9

2011-09-06 Thread Eric Evans
I'm pleased to announce the release of Cassandra 0.7.9. 0.7.9 contains a number of important bug-fixes (full list here[1]), and should be an easy upgrade from previous 0.7 releases. Source and binary distributions are available from the Downloads page[3], and users of Debian and derivative

Re: Why no need to query all nodes on secondary index lookup?

2011-09-06 Thread Kaj Magnus Lindberg
Hi Martin Yes that was helpful, thanks (I had no idea you were reading the Cassandra users list! :-) ) Thanks, (Kaj) Magnus L On Mon, Sep 5, 2011 at 10:57 PM, Martin von Zweigbergk martin.von.zweigbe...@gmail.com wrote: Hi Magnus, I think the answer might be on

CQL and schema-less column family

2011-09-06 Thread osishkin osishkin
Sorry for the newbie question but I failed to find a clear answer. Can CQL be used to query a schema-less column family? can they be indexed? That is, query for column names that do not necessarily exist in all rows, and were not defined in advance when the column family was created. Thank you

Reg Cassandra load balance

2011-09-06 Thread Thamizh
Hi All, I am using Cassandra-0.7.8 on cluster of 4 machines. I have uploaded some files using Map/Reduce. It looks files got distributed only among 2 nodes. When I used RF=3 it had got distributed to equally 4 nodes on below configuration. Here are some config info's: 1.

Re: Reg Cassandra load balance

2011-09-06 Thread Radim Kolar
switch to random (hash) partitioner OR move tokens from your empty nodes into different position in ring; split your full nodes in half. ring will then look like: owns 14% 14% 14% rest of ring.

new to github: Casbase: distributed secondary indexes for Cassandra

2011-09-06 Thread Edward Capriolo
https://github.com/edwardcapriolo/casbase What is it? There are many great articles about building secondary Cassandra indexes such as http://www.anuff.com/2011/02/indexing-in-cassandra.html. In a nutshell, index building boils down to turning a single insert into multiple inserts to support

Re: commodity server spec

2011-09-06 Thread China Stoffen
In general, more smaller is better than fewer big. Probably go for what's cost-effective. Cost effective solution is few and fat servers because it also saves hosting cost. The exception to that would be if you're truly only caring about writes and have *very* few reads that are not latency

Professional Support

2011-09-06 Thread China Stoffen
There is a link to a page which lists few professional support providers on Cassandra homepage. I have contacted few of them and couple are just out of providing support and others didn't reply. So, do you know about any professional support provider for Cassandra solutions and how much they

Re: Professional Support

2011-09-06 Thread Jim Ancona
We use Datastax (http://www.datastax.com) and we have been very happy with the support we've received. We haven't tried any of the other providers on that page, so I can't comment on them. Jim (Disclaimer: no connection with Datastax other than as a satisfied customer.) On Tue, Sep 6, 2011 at

Re: Professional Support

2011-09-06 Thread William Oberman
I also have used datastax with great success (same disclaimer). A specific example: -I setup a one-on-one call to talk through an issue, in my case a server reconfiguration. It took 2 days to find a time to meet, though that was my fault as I believe they could have worked me in within a day. I

Re: Professional Support

2011-09-06 Thread China Stoffen
Thanks for sharing the info.. though I contacted datastax using contact form but no reply yet after more than a week. Probably I need to contact Ben directly. From: Ben Coverston ben.covers...@datastax.com To: user@cassandra.apache.org Sent: Tuesday, September

Calculate number of nodes required based on data

2011-09-06 Thread Hefeng Yuan
Hi, Is there any suggested way of calculating number of nodes needed based on data? We currently have 6 nodes (each has 8G memory) with RF5 (because we want to be able to survive loss of 2 nodes). The flush of memtable happens around every 30 min (while not doing compaction), with ~9m

Re: commodity server spec

2011-09-06 Thread Bill
Mongodb, last time I looked does not scale horizontally. I've seen reasonable behavour putting Cassandra database tables onto remote filers, but you absolutely have to test against the SAN configuration and carefully manage things like concurrent reader/writer settings, the fs and cassandra

Re: Cassandra client loses connectivity to cluster

2011-09-06 Thread Jim Ancona
Since we finally fixed this issue, I thought I'd document the solution, with the hope that it makes it easier for others who might run into it. During the time this issue was occurring Anthony Ikeda reported a very similar issue, although without the strange pattern of occurrences we saw:

Re: Cassandra client loses connectivity to cluster

2011-09-06 Thread Jonathan Ellis
Thanks for the followup, Jim! We'll review https://issues.apache.org/jira/browse/CASSANDRA-3142 shortly. On Tue, Sep 6, 2011 at 2:58 PM, Jim Ancona j...@anconafamily.com wrote: Since we finally fixed this issue, I thought I'd document the solution, with the hope that it makes it easier for

Re: UnavailableException while storing with EACH_QUORUM and RF=3

2011-09-06 Thread Anthony Ikeda
Jonathan, do you know when 0.8.5 will be released? We are looking at a production deployment soon and this fix is something that we would need. Alternatively, what is the stability of the trunk for a production deployment. Anthony On Mon, Sep 5, 2011 at 3:35 PM, Evgeniy Ryabitskiy

Re: UnavailableException while storing with EACH_QUORUM and RF=3

2011-09-06 Thread Jonathan Ellis
0.8.5 is being voted on now on the dev list. I'd encourage you to test it. I do not recommend running trunk. On Tue, Sep 6, 2011 at 5:32 PM, Anthony Ikeda anthony.ikeda@gmail.com wrote: Jonathan, do you know when 0.8.5 will be released? We are looking at a production deployment soon and

Re: UnavailableException while storing with EACH_QUORUM and RF=3

2011-09-06 Thread Anthony Ikeda
Thanks Jonathan, I'll consult with the team. Anthony On Tue, Sep 6, 2011 at 3:34 PM, Jonathan Ellis jbel...@gmail.com wrote: 0.8.5 is being voted on now on the dev list. I'd encourage you to test it. I do not recommend running trunk. On Tue, Sep 6, 2011 at 5:32 PM, Anthony Ikeda

Re: UnavailableException while storing with EACH_QUORUM and RF=3

2011-09-06 Thread Anthony Ikeda
Do you have a link to the downloadable? Anthony On Tue, Sep 6, 2011 at 3:38 PM, Anthony Ikeda anthony.ikeda@gmail.comwrote: Thanks Jonathan, I'll consult with the team. Anthony On Tue, Sep 6, 2011 at 3:34 PM, Jonathan Ellis jbel...@gmail.com wrote: 0.8.5 is being voted on now on

Re: UnavailableException while storing with EACH_QUORUM and RF=3

2011-09-06 Thread Jonathan Ellis
It's linked from the vote thread: http://mail-archives.apache.org/mod_mbox/cassandra-dev/201109.mbox/%3ccakkz8q12k2o7zm5uy9hxnk7kyesqidwcyxbq_uzfna+yaty...@mail.gmail.com%3E On Tue, Sep 6, 2011 at 5:41 PM, Anthony Ikeda anthony.ikeda@gmail.com wrote: Do you have a link to the downloadable?

Solved: NoSuchMethodError with google guava/collections starting embedded cassandra service

2011-09-06 Thread David Hawthorne
I ran into this problem today. It's common enough that it shows up in google, but not common enough to have a documented resolution, so here's one. [junit] com.google.common.collect.ImmutableSet.copyOf(Ljava/util/Collection;)Lcom/google/common/collect/ImmutableSet; [junit]

Re: UnavailableException while storing with EACH_QUORUM and RF=3

2011-09-06 Thread Anthony Ikeda
Thanks Jonathan. On Tue, Sep 6, 2011 at 3:53 PM, Jonathan Ellis jbel...@gmail.com wrote: It's linked from the vote thread: http://mail-archives.apache.org/mod_mbox/cassandra-dev/201109.mbox/%3ccakkz8q12k2o7zm5uy9hxnk7kyesqidwcyxbq_uzfna+yaty...@mail.gmail.com%3E On Tue, Sep 6, 2011 at 5:41

create ks failure with latest github source

2011-09-06 Thread Yang
I pulled the latest code from github 3e77792d31344be0253c89355c1d96ffe03c0659 and used my old commands to create a regular KS, and it failed. quick debugging shows that the client hits an NPE somewhere. am I missing some new mandatory args ? Thanks Yang [default@unknown] connect

Re: create ks failure with latest github source

2011-09-06 Thread Yang
ok, it seems that the '[ ]' should not be there now On Tue, Sep 6, 2011 at 4:56 PM, Yang tedd...@gmail.com wrote: I pulled the latest code from github 3e77792d31344be0253c89355c1d96ffe03c0659 and used my old commands to create a regular KS, and it failed. quick debugging shows that

Cassandra 0.8.4 - doesn't support defining keyspaces in cassandra.yaml?

2011-09-06 Thread Roshan Dawrani
Hi, I have just started the process of upgrading Cassandra from 0.7.2 to 0.8.4, and I am facing some issues with embedded cassandra that we utilize in our application. With 0.7.2, we define our keyspace in cassandra.yaml and use Hector to give us an embedded cassandra instance loaded with schema