Re: Install Cassandra on EC2

2011-08-03 Thread Dave Viner
Hi Eldad, Check out http://wiki.apache.org/cassandra/CloudConfig There are a few ways listed there including a step-by-step guide. Dave Viner On Wed, Aug 3, 2011 at 7:49 AM, Eldad Yamin elda...@gmail.com wrote: Thanks! But I prefer to learn how to Install first - if you have any good

Re: LB scenario

2011-04-05 Thread Dave Viner
overloaded. In practice, I've found it better for the client to have a pool of connections, and then retry as needed to distinct nodes rather than use a load balancer. HTH Dave Viner On Tue, Apr 5, 2011 at 9:51 AM, A J s5a...@gmail.com wrote: Can someone comment on this ? Or is the question

Re: what kind of bug?

2011-03-23 Thread Dave Viner
I saw this once when my servers ran out of file descriptors. This caused totally weird problems. Make sure all nodes in the cluster are listening on the gossip port (7000 by default). Also check out http://www.datastax.com/docs/0.7/troubleshooting/index#view-of-ring-differs-between-some-nodesor

Re: EC2 - 2 regions

2011-03-18 Thread Dave Viner
in a separate region. Taking it step-by-step will ensure that any issues are specific to the region-to-region communication, rather than intra-zone connectivity or cassandra cluster configuration. Dave Viner On Fri, Mar 18, 2011 at 8:34 AM, A J s5a...@gmail.com wrote: Hello, I am trying

Re: EC2 - 2 regions

2011-03-18 Thread Dave Viner
From the us-west instance, are you able to connect to the us-east instance using telnet on port 7000 and 9160? If not, then you need to open those ports for communication (via your Security Group) Dave Viner On Fri, Mar 18, 2011 at 10:20 AM, A J s5a...@gmail.com wrote: Thats exactly what I am

Re: cassandra as user-profile data store

2011-03-01 Thread Dave Viner
for each user to build a Profile? Are you on 0.7 or 0.6.x? Dave Viner On Tue, Mar 1, 2011 at 1:31 AM, Dave Gardner dave.gard...@visualdna.comwrote: Dave Tyler's answer already covers CFs etc.. We are using Cassandra to store user profile data for exactly the sort of use case you describe. We

Re: Cassandra nodes on EC2 in two different regions not communicating

2011-02-24 Thread Dave Viner
a piece-meal approach would be beneficial here. Dave Viner On Thu, Feb 24, 2011 at 6:11 AM, Daniel van Ham Colchete daniel.colch...@gmail.com wrote: Himanshi, my bad, try this for iptables: # SNAT outgoing connections iptables -t nat -A POSTROUTING -p tcp --dport 7000 -d 175.41.143.192 -j

Re: Cassandra nodes on EC2 in two different regions not communicating

2011-02-23 Thread Dave Viner
Try using the IP address, not the dns name in the cassandra.yaml. If you can telnet from one to the other on port 7000, and both nodes have the other node in their config, it should work. Dave Viner On Wed, Feb 23, 2011 at 1:43 AM, Himanshi Sharma himanshi.sha...@tcs.comwrote: Ya they do

cassandra as user-profile data store

2011-02-23 Thread Dave Viner
be interested in buying a car). I don't have specifics as yet... just some general thoughts. But this feels like a Cassandra type problem. (User profile can have lots of columns per user, but the exact columns might differ from user to user... very scalable, etc) Thanks Dave Viner

Re: Cassandra nodes on EC2 in two different regions not communicating

2011-02-23 Thread Dave Viner
. I would first try with the actual public IP address (not the Elastic IP). Once you get that to work, then shutdown the cluster, change the listen_address to the EIP, boot up and try again. Dave Viner On Wed, Feb 23, 2011 at 8:54 PM, Himanshi Sharma himanshi.sha...@tcs.comwrote: Hey Dave

Re: Cassandra nodes on EC2 in two different regions not communicating

2011-02-23 Thread Dave Viner
in rpc address field, Cassandra gives the same exception but if leave it blank then Cassandra runs but again in the nodetool command with ring option it does'nt show the node in another region. Thanks, Himanshi -Dave Viner wrote: - To: user@cassandra.apache.org From: Dave Viner

quick shout-out to the riptano/datastax folks!

2011-02-02 Thread Dave Viner
Just a quick shout-out to the riptano folks and becoming part of/forming DataStax! Congrats!

Re: Upgrading from 0.6 to 0.7.0

2011-01-21 Thread Dave Viner
be fantastic. Dave Viner On Fri, Jan 21, 2011 at 1:01 PM, Aaron Morton aa...@thelastpickle.comwrote: Yup, you can use diff ports and you can give them different cluster names and different seed lists. After you upgrade the second cluster partition the data should repair across, either via RR

Re: Cassandra automatic startup script on ubuntu

2011-01-20 Thread Dave Viner
You can also use the apt-get repository version, which installs the startup script. On http://wiki.apache.org/cassandra/CloudConfig, see the Cassandra Basic Setup section. It applies to any debian based machine, not just cloud instances. HTH Dave Viner On Thu, Jan 20, 2011 at 9:11 AM, Donal

Re: Do you have a site in production environment with Cassandra? What client do you use?

2011-01-15 Thread Dave Viner
Perl using the thrift interface directly. On Sat, Jan 15, 2011 at 6:10 AM, Daniel Lundin d...@eintr.org wrote: python + pycassa scala + Hector On Fri, Jan 14, 2011 at 6:24 PM, Ertio Lew ertio...@gmail.com wrote: Hey, If you have a site in production environment or considering so, what

anyone using Cassandra as an analytics/data warehouse?

2011-01-04 Thread Dave Viner
? - tell me how many page views occurred between 12/01/2010 and 12/31/2010 from the US in the 9th hour of the day (in gmt)? Time slicing and dimension slicing seems like it might be very challenging (especially since the windows of time would not be known in advance). Thanks Dave Viner

Re: anyone using Cassandra as an analytics/data warehouse?

2011-01-04 Thread Dave Viner
. Then look at the current row every hour to actually compile the numbers, and store the count in the same Column Family LogCF[hour-day-in-epoch-seconds][total] = x where 'x' is the sum of the number of timeuuid columns in the row? Is that what you're envisioning in Option A? Thanks Dave Viner

Re: Does Cassandra run better on Amazon EC2 or Rackspace cloud servers?

2011-01-03 Thread Dave Viner
Since it's all pay-for-use, you could build your system on both, then do whatever stress testing you want. The cassandra part of your app should be unchanged between different cloud providers. Personally, I'm using EC2 and don't have any complaints. Dave Viner On Mon, Jan 3, 2011 at 3:49 PM

Re: Virtual IP / hardware load balancing for cassandra nodes

2010-12-20 Thread Dave Viner
You can put a Cassandra cluster behind a load balancer. One thing to be cautious of is the health check. Just because the node is listening on port 9160 doesn't mean that it's healthy to serve requests. It is required, but not sufficient. The real test is the JMX values. Dave Viner On Mon

Re: Cassandra Monitoring

2010-12-19 Thread Dave Viner
How does mx4j compare with the earlier jmx-to-rest bridge listed in the operations page: JMX-to-REST bridge available at http://code.google.com/p/polarrose-jmx-rest-bridge; Thanks Dave Viner On Sun, Dec 19, 2010 at 7:01 AM, Ran Tavory ran...@gmail.com wrote: FYI, I just added an mx4j section

Re: Cassandra Monitoring

2010-12-19 Thread Dave Viner
start Cassandra jmx-to-rest runs in a separate jvm. It also has a nice useful HTML interface that you can look into any running host. On Sunday, December 19, 2010, Dave Viner davevi...@gmail.com wrote: How does mx4j compare with the earlier jmx-to-rest bridge listed in the operations

Re: Facebook messaging and choice of HBase over Cassandra - what can we learn?

2010-11-21 Thread Dave Viner
and I'm sure he could make HBase bend to his will at any point. Dave Viner On Sun, Nov 21, 2010 at 4:16 PM, Todd Lipcon t...@lipcon.org wrote: On Sun, Nov 21, 2010 at 2:06 PM, Edward Ribeiro edward.ribe...@gmail.comwrote: Also I believe saying HBASE is consistent is not true. This can happen

Re: Cold boot performance problems

2010-10-08 Thread Dave Viner
Has anyone found solid step-by-step docs on how to raid0 the ephemeral disks in ec2 for use by Cassandra? On Fri, Oct 8, 2010 at 12:11 PM, Jason Horman jhor...@gmail.com wrote: We are currently using EBS with 4 volumes striped with LVM. Wow, we didn't realize you could raid the ephemeral

Re: Advice on settings

2010-10-07 Thread Dave Viner
not pay inbound-outbound fees for the data xfer. HTH, Dave Viner On Thu, Oct 7, 2010 at 10:26 AM, B. Todd Burruss bburr...@real.com wrote: if you are updating columns quite rapidly, you will scatter the columns over many sstables as you update them over time. this means that a read

Re: Dazed and confused with Cassandra on EC2 ...

2010-09-17 Thread Dave Viner
that I understand what you're asking about (because I have similar ec2-related thoughts). Let me know if this is an accurate summary. Dave Viner On Fri, Sep 17, 2010 at 7:41 AM, Jedd Rashbrooke jedd.rashbro...@imagini.net wrote: Howdi, I've just landed in an experiment to get Cassandra

Re: Monitoring with Cacti

2010-09-12 Thread Dave Viner
- https://support.cloudkick.com/Cassandra_Checks. Dave Viner On Fri, Sep 10, 2010 at 8:31 PM, Edward Capriolo edlinuxg...@gmail.comwrote: On Fri, Sep 10, 2010 at 7:29 PM, aaron morton aa...@thelastpickle.com wrote: Am going through the rather painful process of trying to monitor cassandra using

Re: Cassandra HAProxy

2010-08-30 Thread Dave Viner
the rotation. I'd be happy to help with this, as I know how it works with haproxy and standard web servers or other tcp servers. But, I'm not sure how to make it work with Cassandra, since, as Ben points out, it can return valid tcp responses (that say error-condition) on the standard port. Dave Viner

Re: Thrift + PHP: help!

2010-08-19 Thread Dave Viner
I am a user of the perl api - so I'd like to lurk in case there are things that can benefit both perl php. Dave Viner On Wed, Aug 18, 2010 at 1:35 PM, Gabriel Sosa sosagabr...@gmail.com wrote: I would like to help with this too! On Wed, Aug 18, 2010 at 5:15 PM, Bas Kok bakot...@gmail.com

Re: error using get_range_slice with random partitioner

2010-08-06 Thread Dave Viner
; undef $protocol; undef $transport; undef $socket; } HTH Dave Viner On Fri, Aug 6, 2010 at 7:45 AM, Adam Crain adam.cr...@greenenergycorp.comwrote: Thomas, That was indeed the source of the problem. I naively assumed that the token range would help me avoid retrieving duplicate rows

Re: Please need help with Munin: Cassandra Munin plugin problem

2010-07-29 Thread Dave Viner
Is your code posted somewhere such that others could try it? On Thu, Jul 29, 2010 at 5:57 AM, Miriam Allalouf miriam.allal...@gmail.comwrote: Hi, Please, can someone help us with Munin?? Thanks, Miriam On Mon, Jul 26, 2010 at 1:58 PM, osishkin osishkin osish...@gmail.com wrote: Hi,

iterating over all rows keys gets duplicate key returns

2010-07-28 Thread Dave Viner
the keys. Any suggestions on how I can properly iterate? Thanks Dave Viner

Re: iterating over all rows keys gets duplicate key returns

2010-07-28 Thread Dave Viner
an iteration returns no new keys, assume that all keys have been seen and exit. - this also fails, since a particular result set can be full of duplicates, but the iteration has not traversed the entire row-key spectrum. Dave Viner On Wed, Jul 28, 2010 at 3:48 PM, Rob Coli rc...@digg.com wrote

Re: Quick Poll: Server names

2010-07-27 Thread Dave Viner
I've seen used several... names of children of employees of the company names of streets near office names of diseases (lead to very hard to spell names after a while, but was quite educational for most developers) names of characters from famous books (e.g., lord of the rings, asimov novels,

Re: non blocking Cassandra with Tornado

2010-07-27 Thread Dave Viner
the new async Thrift client in Java for inspiration: http://blog.rapleaf.com/dev/2010/06/23/fully-async-thrift-client-in-java/ Or, even better, port the Thrift async client to work for python and other languages. Dave Viner On Tue, Jul 27, 2010 at 8:44 AM, Peter Schuller peter.schul...@infidyne.com

Re: SV: How to stop cassandra server, installed from debian/ubuntupackage

2010-07-26 Thread Dave Viner
Yes... if you're using debian cassandra you can do: /etc/init.d/cassandra stop On Mon, Jul 26, 2010 at 8:05 AM, Lee Parker l...@socialagency.com wrote: Which debian/ubuntu packages are you using? I am using the ones that are maintained by Eric Evans and the init.d script stops the server

Re: Design questions/Schema help

2010-07-26 Thread Dave Viner
AFAIK, atomic increments are not available. There recently has been quite a bit of discussion about them. So, you might search the archives. Dave Viner On Mon, Jul 26, 2010 at 7:02 PM, Mark static.void@gmail.com wrote: On 7/26/10 6:06 PM, Dave Viner wrote: I'd love to hear other's

Re: Cassandra Chef recipe and EC2 snitch

2010-07-22 Thread Dave Viner
. You can also use rackaware-ness in EC2. Just add in the PropertyFile endpoint and put your rack file in /etc/cassandra/rack.properties. Dave Viner On Thu, Jul 22, 2010 at 10:08 AM, Allan Carroll alla...@gmail.com wrote: Hi all, I'm setting up a new cluster on EC2 for the first time

Re: Suggestion for the storage.conf

2010-07-19 Thread Dave Viner
Added: http://wiki.apache.org/cassandra/StorageConfiguration On Mon, Jul 19, 2010 at 2:55 AM, Dimitry Lvovsky dimi...@reviewpro.comwrote: I think it would be a good idea to add a bit more explanation storage-conf.xml/wiki regarding the replication factor. It caused some confusion until we

Re: Cassandra benchmarking on Rackspace Cloud

2010-07-19 Thread Dave Viner
, but perhaps they have something similar? This feels like the kind of problem that might be easier for someone else to setup and quickly test. (The beauty of the virtual server - quick setup and quick tear down) Dave Viner On Mon, Jul 19, 2010 at 10:24 AM, Peter Schuller peter.schul...@infidyne.com

Re: A very short summary on Cassandra for a book

2010-07-15 Thread Dave Viner
universal. Cassandra is eventually consistent. So atomic increment doesn't really work in the eventual world. But, more precisely, one can add, update, change, modify, delete rows, columns, and values at any time from any node. Also sorting happens on insert time Yes, I believe this is true. Dave

Re: Elastic Load Balancing Cassandra

2010-07-13 Thread Dave Viner
I haven't used ELB, but I've setup HAProxy to do it... appears to work well so far. Dave Viner On Tue, Jul 13, 2010 at 3:30 PM, Brian Helfrich helfrich9...@gmail.comwrote: Hi, has anyone been able to load balance a Cassandra cluster with an AWS Elastic Load Balancer? I've setup an ELB

Re: Is anyone using version 0.7 schema update API

2010-07-13 Thread Dave Viner
choosing. According to http://wiki.apache.org/thrift/, Thrift has generators for C++, C#, Erlang, Haskell, Java, Objective C/Cocoa, OCaml, Perl, PHP, Python, Ruby, and Squeak HTH Dave Viner On Tue, Jul 13, 2010 at 6:05 PM, GH gavan.h...@gmail.com wrote: To be honest I do not know how to regenerate

Re: How to add a new Keyspace?

2010-07-08 Thread Dave Viner
://www.mail-archive.com/user@cassandra.apache.org/msg02498.html HTH Dave Viner On Wed, Jul 7, 2010 at 11:39 PM, Peter Schuller peter.schul...@infidyne.com wrote: If I want to add a new Keyspace, does it mean I have to distribute my storage-conf.xml to whole nodes? and restart whole nodes? I

Re: Property file snitch for Cassandra?

2010-07-07 Thread Dave Viner
not yet proven to myself that this is accurate, but it definitely stops the error messages and , from looking at the code, seems like it should work. Is this correct? Thanks Dave Viner On Wed, Jul 7, 2010 at 6:22 PM, Eric Evans eev...@rackspace.com wrote: Let's move this to the user@ list

Backing up the data stored in cassandra

2010-07-07 Thread Dave Viner
? (That is, bring up a new node, copy the backed up files from the crashed node onto the new node, then have the new node join the cluster?) Thanks Dave Viner