2) a practical/situational view of managing a cassandra cluster
...
it would be nice to have a more comprehensive deployment guide.
You're right. Maybe we can get Digg to share theirs. :)
We don't have any such thing. The deployment at Digg is just as alpha as
the deployment anywhere
On Wed, Nov 25, 2009 at 6:38 PM, time t...@digg.com wrote:
If I query a key range M..N, what nodes would likely answer?
I also would enjoy this feature. For my evaluation I was attempting
to verify that data was being replicated to the proper nodes. I was
able to do this superficially by
On Mon, 23 Nov 2009 23:30:51 -0500 Matt Revelle mreve...@gmail.com wrote:
MR Are you both using timestamps as row keys? Would be great to hear
MR more details.
I'm using super column keys in a super column.
So let's say your resource is routerA.
Your data will be:
Row routerA
SuperColumn
We at Platform46 are building an on-premise enterprise appliance which
provides twitter-like, open-follower, short messaging services for
internal corporate networks.
We are using Cassandra, Python and RabbitMQ to help us build a
scalable solution where appliances may be configured as true peers
I work for Comcast, and we have tons of data that we are migrating
into non-relational storage.
we recently evaluated cassandra, riak, voldemort, and hdfs. I focused
on cassandra, this is why you may have seen me asking dumb questions
over IRC :-)
A few desirables for cassandra:
1) I'm not a
On Fri, 20 Nov 2009 17:38:39 -0800 Dan Di Spaltro dan.dispal...@gmail.com
wrote:
DDS At Cloudkick we are using Cassandra to store monitoring statistics and
DDS running analytics over the data. I would love to share some ideas
DDS about how we set up our data-model, if anyone is interested.
Thanks for the replies, everyone.
A couple people have suggested that we put this on the
wiki to replace the old, never-updated PoweredBy page (maybe as
UsersSurvey09). I'll pull the _public_ responses only into a wiki
page later this week; if you replied to the list but don't want to be
on the
On Nov 23, 2009, at 12:27, Ted Zlatanov t...@lifelogs.com wrote:
On Fri, 20 Nov 2009 17:38:39 -0800 Dan Di Spaltro dan.dispal...@gmail.com
wrote:
DDS At Cloudkick we are using Cassandra to store monitoring
statistics and
DDS running analytics over the data. I would love to share some
We're using Cassandra in development to store custom index information
on large document sets. Also considered HBase and Voldemort.
Cassandra's data model and performance tradeoffs seemed to best fit
our needs.
Features that we're looking forward to seeing:
* map/reduce integration
*
Hi Jonathan,
Firstly, thanks for all your help on this list. Without lots of your
solutions / tips etc I probably wouldn't be using Cassandra.
I've built a real-time search engine based around all the links that appear
on twitter (http://www.mozzler.com/). Cassandra is my data store for all the
Hello, I'm working at bee.tv. We build actually a large application related
to smart TV and movies recommendations.
We've developed an application wich was only based on Oracle + Java server
side but it was not enough scalable solution. Of course Oracle have
capabilities to scale but you need to
Hi Jonathon,
I'd say I am at the evaluation stage. The only reason I am looking at nosql
type applications instead of using mysql is the vain hope my application
will one day scale to the point that mysql won't be the best option.
Cassandra appears to be the best fit for the requirement I have
Hi,
I'm using Cassandra 0.4.2 at my current client to persist URL graphs
for Spam detection.
The crawling and page classification is done in Hadoop/Bixo/Cascading,
which persists URL classification results into Cassandra.
The incoming production traffic is using Cassandra for the real-time
spam
The company I'm with is still small and in the early stages, but we're
planning on using Cassandra for user profile information (in
development right now), and possibly other uses later on. We
evaluated CouchDB and Voldermort, and both of those were great as well
- for CouchDB, I really liked
I am evaluating NoSQL alternatives to your typical hard to scale
RDBMS, specifically Key/Value stores. I'm not looking for query
capabilities. I want very very very high availability with very very
large amounts of data.
I have reduced down my list to Cassandra, Voldemort, Riak, and CouchDB.
We are currently evaluating Cassandra, and using it for a small
feature in production. We are only using the basic insert/get/remove
from the API, with a standard column family. So far, I like a lot of
what Cassandra offers, though I had some tough times with it.
* Version 0.4.2 seems very
At twitter we're working on using Cassandra to replace our currents
storage for all tweets. We have a cluster in production that's being
populated outside the the user-critical path (ie, the cassandra
writing is async).
Additionally, we're testing and evaluating for basically everything
else in
We're looking at it to be part of a near real time Web analytics engine, which
sounds similar to Ooyala.
at the moment I'm pushing to get the thing open sourced if possible.
we're looking at combining Cassandra + Esper, but we are still in the very
early stages.
On Nov 21, 2009, at 8:17 AM,
For a project I am working on now at Onespot we are just beginning to move
off RDBMS and onto Cassandra for a subset of our data store. We evaluated
against several other solutions including Tokyo, Voldemort and Riak and
Cassandra seemed the clear winner for our requirements. We have also done
I'm about to release a twitter search engine built ontop of cassandra. If
you are interested in beta testing it let me know.
I would like to see cassandra support increment/decrement.
-Jake
On Fri, Nov 20, 2009 at 4:17 PM, Jonathan Ellis jbel...@gmail.com wrote:
Hi all,
I'd love to get a
I would love to see that post about your data model.
J.
Sent from my iPhone.
On 2009-11-20, at 5:38 PM, Dan Di Spaltro dan.dispal...@gmail.com
wrote:
At Cloudkick we are using Cassandra to store monitoring statistics and
running analytics over the data. I would love to share some ideas
---
Sent from my phone
Ian Holsman - 703 879-3128
On 21/11/2009, at 12:38 PM, Dan Di Spaltro dan.dispal...@gmail.com
wrote:
At Cloudkick we are using Cassandra to store monitoring statistics and
running analytics over the data. I would love to share some ideas
about how we set up our
22 matches
Mail list logo