Re: Cassandra users survey

2009-11-25 Thread time
2) a practical/situational view of managing a cassandra cluster ... it would be nice to have a more comprehensive deployment guide. You're right. Maybe we can get Digg to share theirs. :) We don't have any such thing. The deployment at Digg is just as alpha as the deployment anywhere

Re: Cassandra users survey

2009-11-25 Thread matthew hawthorne
On Wed, Nov 25, 2009 at 6:38 PM, time t...@digg.com wrote: If I query a key range M..N, what nodes would likely answer? I also would enjoy this feature. For my evaluation I was attempting to verify that data was being replicated to the proper nodes. I was able to do this superficially by

Re: Cassandra users survey

2009-11-24 Thread Ted Zlatanov
On Mon, 23 Nov 2009 23:30:51 -0500 Matt Revelle mreve...@gmail.com wrote: MR Are you both using timestamps as row keys? Would be great to hear MR more details. I'm using super column keys in a super column. So let's say your resource is routerA. Your data will be: Row routerA SuperColumn

Re: Cassandra users survey

2009-11-24 Thread Rich Atkinson
We at Platform46 are building an on-premise enterprise appliance which provides twitter-like, open-follower, short messaging services for internal corporate networks. We are using Cassandra, Python and RabbitMQ to help us build a scalable solution where appliances may be configured as true peers

Re: Cassandra users survey

2009-11-24 Thread matthew hawthorne
I work for Comcast, and we have tons of data that we are migrating into non-relational storage. we recently evaluated cassandra, riak, voldemort, and hdfs. I focused on cassandra, this is why you may have seen me asking dumb questions over IRC :-) A few desirables for cassandra: 1) I'm not a

Re: Cassandra users survey

2009-11-23 Thread Ted Zlatanov
On Fri, 20 Nov 2009 17:38:39 -0800 Dan Di Spaltro dan.dispal...@gmail.com wrote: DDS At Cloudkick we are using Cassandra to store monitoring statistics and DDS running analytics over the data. I would love to share some ideas DDS about how we set up our data-model, if anyone is interested.

Re: Cassandra users survey

2009-11-23 Thread Jonathan Ellis
Thanks for the replies, everyone. A couple people have suggested that we put this on the wiki to replace the old, never-updated PoweredBy page (maybe as UsersSurvey09). I'll pull the _public_ responses only into a wiki page later this week; if you replied to the list but don't want to be on the

Re: Cassandra users survey

2009-11-23 Thread Matt Revelle
On Nov 23, 2009, at 12:27, Ted Zlatanov t...@lifelogs.com wrote: On Fri, 20 Nov 2009 17:38:39 -0800 Dan Di Spaltro dan.dispal...@gmail.com wrote: DDS At Cloudkick we are using Cassandra to store monitoring statistics and DDS running analytics over the data. I would love to share some

Re: Cassandra users survey

2009-11-21 Thread Phillip Michalak
We're using Cassandra in development to store custom index information on large document sets. Also considered HBase and Voldemort. Cassandra's data model and performance tradeoffs seemed to best fit our needs. Features that we're looking forward to seeing: * map/reduce integration *

Re: Cassandra users survey

2009-11-21 Thread Chris Were
Hi Jonathan, Firstly, thanks for all your help on this list. Without lots of your solutions / tips etc I probably wouldn't be using Cassandra. I've built a real-time search engine based around all the links that appear on twitter (http://www.mozzler.com/). Cassandra is my data store for all the

Re: Cassandra users survey

2009-11-21 Thread Richard Grossman
Hello, I'm working at bee.tv. We build actually a large application related to smart TV and movies recommendations. We've developed an application wich was only based on Oracle + Java server side but it was not enough scalable solution. Of course Oracle have capabilities to scale but you need to

Re: Cassandra users survey

2009-11-20 Thread Joseph Bowman
Hi Jonathon, I'd say I am at the evaluation stage. The only reason I am looking at nosql type applications instead of using mysql is the vain hope my application will one day scale to the point that mysql won't be the best option. Cassandra appears to be the best fit for the requirement I have

Re: Cassandra users survey

2009-11-20 Thread Erich Nachbar
Hi, I'm using Cassandra 0.4.2 at my current client to persist URL graphs for Spam detection. The crawling and page classification is done in Hadoop/Bixo/Cascading, which persists URL classification results into Cassandra. The incoming production traffic is using Cassandra for the real-time spam

Re: Cassandra users survey

2009-11-20 Thread Simon Smith
The company I'm with is still small and in the early stages, but we're planning on using Cassandra for user profile information (in development right now), and possibly other uses later on. We evaluated CouchDB and Voldermort, and both of those were great as well - for CouchDB, I really liked

Re: Cassandra users survey

2009-11-20 Thread B. Todd Burruss
I am evaluating NoSQL alternatives to your typical hard to scale RDBMS, specifically Key/Value stores. I'm not looking for query capabilities. I want very very very high availability with very very large amounts of data. I have reduced down my list to Cassandra, Voldemort, Riak, and CouchDB.

Re: Cassandra users survey

2009-11-20 Thread Ramzi Rabah
We are currently evaluating Cassandra, and using it for a small feature in production. We are only using the basic insert/get/remove from the API, with a standard column family. So far, I like a lot of what Cassandra offers, though I had some tough times with it. * Version 0.4.2 seems very

Re: Cassandra users survey

2009-11-20 Thread Ryan King
At twitter we're working on using Cassandra to replace our currents storage for all tweets. We have a cluster in production that's being populated outside the the user-critical path (ie, the cassandra writing is async). Additionally, we're testing and evaluating for basically everything else in

Re: Cassandra users survey

2009-11-20 Thread Ian Holsman
We're looking at it to be part of a near real time Web analytics engine, which sounds similar to Ooyala. at the moment I'm pushing to get the thing open sourced if possible. we're looking at combining Cassandra + Esper, but we are still in the very early stages. On Nov 21, 2009, at 8:17 AM,

Re: Cassandra users survey

2009-11-20 Thread scott w
For a project I am working on now at Onespot we are just beginning to move off RDBMS and onto Cassandra for a subset of our data store. We evaluated against several other solutions including Tokyo, Voldemort and Riak and Cassandra seemed the clear winner for our requirements. We have also done

Re: Cassandra users survey

2009-11-20 Thread Jake Luciani
I'm about to release a twitter search engine built ontop of cassandra. If you are interested in beta testing it let me know. I would like to see cassandra support increment/decrement. -Jake On Fri, Nov 20, 2009 at 4:17 PM, Jonathan Ellis jbel...@gmail.com wrote: Hi all, I'd love to get a

Re: Cassandra users survey

2009-11-20 Thread James Golick
I would love to see that post about your data model. J. Sent from my iPhone. On 2009-11-20, at 5:38 PM, Dan Di Spaltro dan.dispal...@gmail.com wrote: At Cloudkick we are using Cassandra to store monitoring statistics and running analytics over the data. I would love to share some ideas

Re: Cassandra users survey

2009-11-20 Thread Ian Holsman
--- Sent from my phone Ian Holsman - 703 879-3128 On 21/11/2009, at 12:38 PM, Dan Di Spaltro dan.dispal...@gmail.com wrote: At Cloudkick we are using Cassandra to store monitoring statistics and running analytics over the data. I would love to share some ideas about how we set up our