Re: V2 Beta 1 bootstrap behaviour ?

2013-07-16 Thread Andrew Cobley
Righto ! Will do a JIRA report.. Many thanks Andy On 16 Jul 2013, at 18:50, Robert Coli mailto:rc...@eventbrite.com>> wrote: On Tue, Jul 16, 2013 at 1:16 AM, Andrew Cobley mailto:a.e.cob...@dundee.ac.uk>> wrote: I'm setting up a new test cluster using 2.0.0-beta1

Re: Deletion use more space.

2013-07-16 Thread Andrew Bialecki
I don't think setting gc_grace_seconds to an hour is going to do what you'd expect. After gc_grace_seconds, if you haven't run a repair within that hour, the data you deleted will seem to have been undeleted. Someone correct me if I'm wrong, but in order to order to completely delete data and rega

Re: node tool ring displays 33.33% owns on 3 node cluster with replication

2013-07-12 Thread Andrew Bialecki
nclude the keyspace as part of your query, you'll get it weighted by the RF of that keyspace. I believe the same logic applies for nodetool status. Andrew On Thu, Jul 11, 2013 at 12:58 PM, Jason Tyler wrote: > Thanks Rob! I was able to confirm with getendpoints. > > Cheers

Re: Lots of replicate on write tasks pending, want to investigate

2013-07-03 Thread Andrew Bialecki
(not replicate on write) for counters involves a read, so that explains the high disk I/O, but for that I'd expect to see many write requests pending (which we see a bit), but not replicate on writes backing up. What am I missing? Andrew On Wed, Jul 3, 2013 at 1:03 PM, Robert Coli wrote:

Lots of replicate on write tasks pending, want to investigate

2013-07-03 Thread Andrew Bialecki
2. I'm assuming in our case the cause is incrementing counters because disk reads are part of the write path for counters and are not for appending columns to a row. Does that logic make sense? Thanks in advance, Andrew

Re: How to do a CAS UPDATE on single column CF?

2013-07-01 Thread Andrew Cobley
According to Jonathan Ellis talk at Cassandra 13 it does use Paxos: http://www.youtube.com/watch?v=PcUpPR4nSr4&list=PLqcm6qE9lgKJzVvwHprow9h7KMpb5hcUU http://www.slideshare.net/jbellis/cassandra-summit-2013-keynote Andy On 1 Jul 2013, at 19:40, Francisco Andrades Grassi mailto:bigjoc...@gmail.

Re: Counter value becomes incorrect after several dozen reads & writes

2013-06-25 Thread Andrew Bialecki
f the behavior can be duplicated. Andrew On Tue, Jun 25, 2013 at 2:18 PM, Robert Coli wrote: > On Mon, Jun 24, 2013 at 6:42 PM, Josh Dzielak wrote: > > There is only 1 thread running this sequence, and consistency levels are > set > > to ALL. The behavior is fairly repeatable - the

Updated sstable size for LCS, ran upgradesstables, file sizes didn't change

2013-06-21 Thread Andrew Bialecki
ance with varying sstable sizes. Thanks in advance! Andrew

Re: Does replicate_on_write=true imply that CL.QUORUM for reads is unnecessary?

2013-06-02 Thread Andrew Bialecki
e on write should normally always be turned on, or the change >> will only be recorded on one node. Replicate on write is asynchronous >> with respect to the request and doesn't affect consistency level at >> all. >> >> >> On Wed, May 29, 2013 at 7:32 P

Re: Does replicate_on_write=true imply that CL.QUORUM for reads is unnecessary?

2013-05-29 Thread Andrew Bialecki
ess of what you actually set it to (and for good reason). On Wed, May 29, 2013 at 9:47 AM, Andrew Bialecki wrote: > Quick question about counter columns. In looking at the replicate_on_write > setting, assuming you go with the default of "true", my understanding is it > writes th

Does replicate_on_write=true imply that CL.QUORUM for reads is unnecessary?

2013-05-29 Thread Andrew Bialecki
Quick question about counter columns. In looking at the replicate_on_write setting, assuming you go with the default of "true", my understanding is it writes the increment to all replicas on any increment. If that's the case, doesn't that mean there's no point in using CL.QUORUM for reads because

Re: Observation on shuffling vs adding/removing nodes

2013-03-24 Thread Andrew Bialecki
- > Aaron Morton > Freelance Cassandra Consultant > New Zealand > > @aaronmorton > http://www.thelastpickle.com > > On 24/03/2013, at 9:41 AM, Andrew Bialecki > wrote: > > Just curious if anyone has any thoughts on something we've observed in a

Observation on shuffling vs adding/removing nodes

2013-03-23 Thread Andrew Bialecki
ng and shutting down nodes extremely easy. Cheers, Andrew

Bootstrapping a node in 1.2.2

2013-03-19 Thread Andrew Bialecki
I've got a 3 node cluster in 1.2.2 and just bootstrapped a new node into it. For each of the existing nodes, I had num tokens set to 256 and for the new node I also had it set to 256, however after bootstrapping into the cluster, "nodetool status " for my main keyspace which has RF=2 now reports:

Re: How to configure linux service for Cassandra

2013-03-19 Thread Andrew Cobley
Does this help you ? https://github.com/acobley/CassandraStartup it was built for Raspbian, but might help you. Andy On 19 Mar 2013, at 11:10, Roshan mailto:codeva...@gmail.com>> wrote: Hi I want to start the cassandra as a service. At the moment it is starting as a background task. Cassand

Re: Nodetool drain automatically shutting down node?

2013-03-08 Thread Andrew Bialecki
forceFlush requested but everything is clean in Standard1 INFO [RMI TCP Connection(2)-10.116.111.143] 2013-03-09 03:54:33,510 StorageService.java (line 774) DRAINED On Fri, Mar 8, 2013 at 10:36 PM, Andrew Bialecki wrote: > Hey all, > > We're getting ready to upgrade our cluster to 1.

Cassandra as a service on Windows

2013-02-14 Thread Andrew Cobley
Hi all,\ According to http://www.datastax.com/dev/blog/whats-new-in-cassandra-1-0-windows-service-new-cql-clients-and-more running cassandra.bat install should make cassandra run on a service on a windows box. However I'm getting the following when I try: C:\apache-cassandra-1.2.1\bin>cassandr

Re: Running Cassandra 1.1, how can I see the efficiency of the key cache?

2012-12-22 Thread Andrew Bialecki
Thanks, I'll take a look at that too. I also found that "nodetool info" gives some information as well. For instance, here's what one node reads: Key Cache: size 104857584 (bytes), capacity 104857584 (bytes), 15085408 hits, 17336031 requests, 0.870 recent hit rate, 14400 save period in seconds.

Need to run nodetool repair on a cluster running 1.1.6 if no deletes

2012-12-22 Thread Andrew Bialecki
letes? Anything drawbacks to not running it? Thanks, Andrew

Running Cassandra 1.1, how can I see the efficiency of the key cache?

2012-12-22 Thread Andrew Bialecki
nged the defaults there, so the key cache setting in cassandra.yaml is still blank. Thanks for any help and happy holidays, Andrew

Re: Simulating a failed node

2012-10-29 Thread Andrew Bialecki
Thanks, extremely helpful. The key bit was I wasn't flushing the old Keyspace before re-running the stress test, so I was stuck at RF = 1 from a previous run despite passing RF = 2 to the stress tool. On Sun, Oct 28, 2012 at 2:49 AM, Peter Schuller wrote: > > Operation [158320] retried 10 times

Re: Simulating a failed node

2012-10-27 Thread Andrew Bialecki
RF and CL are you using? > > > On 2012/10/28, at 13:13, Andrew Bialecki > wrote: > > Hey everyone, > > I'm trying to simulate what happens when a node goes down to make sure my > cluster can gracefully handle node failures. For my setup I have a 3 node > cluster runni

Simulating a failed node

2012-10-27 Thread Andrew Bialecki
writes failing even after a retry? It might be the stress client doesn't pool connections (I took a quick look, but might've not looked deeply enough), however I also tried only specifying the first two server nodes and then downing the third with the same failure. Thanks in advance. Andrew

unsubscribe

2012-07-30 Thread Andrew Knox
unsubscribe

Unstable cassandra - , Cannot allocate memory

2010-12-15 Thread Dr. Andrew Perella
ot allocate memory at java.lang.UNIXProcess.(UNIXProcess.java:148) at java.lang.ProcessImpl.start(ProcessImpl.java:65) at java.lang.ProcessBuilder.start(ProcessBuilder.java:453) ... 12 more Dr. Andrew Perella CTO / Chief Software Architect Eutechnyx Limited. Metro Centre

RE: Newbie question on Cassandra mem usage

2010-11-26 Thread Dr. Andrew Perella
Sorry - just realised this is now a parameter on the CFDef From: Dr. Andrew Perella [mailto:a...@eutechnyx.com] Sent: 26 November 2010 14:17 To: user@cassandra.apache.org Subject: RE: Newbie question on Cassandra mem usage How can I set these per CF when I create them dynamically? Regards

RE: Newbie question on Cassandra mem usage

2010-11-26 Thread Dr. Andrew Perella
How can I set these per CF when I create them dynamically? Regards, Andrew From: Aaron Morton [mailto:aa...@thelastpickle.com] Sent: 22 November 2010 21:40 To: user@cassandra.apache.org Subject: Re: Newbie question on Cassandra mem usage They are memtable_throughput_in_mb

RE: casssandra 0.7beta3, pycassa and windows client timestamps

2010-11-18 Thread Dr. Andrew Perella
Actually it turns out there is a submitted patch for this already from January 2009 but it was never accepted due to a complaint about an unnecessary cast! I have modified and resubmitted the patch. -Original Message- From: Dr. Andrew Perella [mailto:a...@eutechnyx.com] Sent: 18

RE: casssandra 0.7beta3, pycassa and windows client timestamps

2010-11-18 Thread Dr. Andrew Perella
project? On Thu, Nov 18, 2010 at 2:13 PM, Dr. Andrew Perella wrote: > UPDATE:  Solved > > > > After digging deeper I realised that I had patched fastbinary incorrectly > for compiling under visual studio on windows and was left with the wrong > endian define. > > As the app of

RE: casssandra 0.7beta3, pycassa and windows client timestamps

2010-11-18 Thread Dr. Andrew Perella
. Regards, Andrew From: Tyler Hobbs [mailto:ty...@riptano.com] Sent: 18 November 2010 00:00 To: user@cassandra.apache.org Subject: Re: casssandra 0.7beta3, pycassa and windows client timestamps No, there's no client side caching. Assuming you're using insert(), could you capture both of the

RE: casssandra 0.7beta3, pycassa and windows client timestamps

2010-11-18 Thread Dr. Andrew Perella
et it to go wrong. Thankyou for all your help, I am sure this issue will return so I will hopefully come back with more info. Is there something in the log that lets us know the insert has worked? ie the timestamp check has passed? Regards, Andrew From: Tyler Hobbs [mailto:ty...@riptan

RE: casssandra 0.7beta3, pycassa and windows client timestamps

2010-11-17 Thread Dr. Andrew Perella
lient timestamps Well, the write appears to be succeeding, and since you say this works on Linux, it sounds like a client side problem. Are you using pycassa or are you dealing with raw Thrift. If pycassa, what version? - Tyler On Wed, Nov 17, 2010 at 5:02 PM, Dr. Andrew Perella mai

RE: casssandra 0.7beta3, pycassa and windows client timestamps

2010-11-17 Thread Dr. Andrew Perella
22:59:42,937 quorumResponseHandler: 0 ms. From: Tyler Hobbs [mailto:ty...@riptano.com] Sent: 17 November 2010 22:56 To: user@cassandra.apache.org Subject: Re: casssandra 0.7beta3, pycassa and windows client timestamps Can you turn on debug logging on Cassandra (change INFO to debug in c

RE: casssandra 0.7beta3, pycassa and windows client timestamps

2010-11-17 Thread Dr. Andrew Perella
ultinode cluster or just one instance of Cassandra? - Tyler On Wed, Nov 17, 2010 at 4:30 PM, Dr. Andrew Perella mailto:a...@eutechnyx.com>> wrote: Miliseconds - but that's not the problem - I can wait 10 minutes and still not get the value updated! Other times I can update many t

RE: casssandra 0.7beta3, pycassa and windows client timestamps

2010-11-17 Thread Dr. Andrew Perella
t: Re: casssandra 0.7beta3, pycassa and windows client timestamps What resolution does time.time() give you in Windows? - Tyler On Wed, Nov 17, 2010 at 4:23 PM, Dr. Andrew Perella mailto:a...@eutechnyx.com>> wrote: I have encountered a strange problem with values not being written to Cassandra ver

casssandra 0.7beta3, pycassa and windows client timestamps

2010-11-17 Thread Dr. Andrew Perella
away. Has anyone seen anything like this or will I need to delve deeper into thift? FYI I am using pythons int(time.time()*1e6) as a timestamp Best Regards, Andrew This e-mail is confidential and may be privileged. It may be read, copied and used only by the intended recipient. No communication

RE: Cassandra - python communication problem

2010-11-09 Thread Dr. Andrew Perella
Nov 9, 2010 at 12:07 PM, Dr. Andrew Perella mailto:a...@eutechnyx.com>> wrote: > Thanks Jonathan, > So it looks like the python bindings are using a deprecated non framed > protocol. Leaving that problem aside for a moment I changed the server to > work in non framed mode an

RE: Cassandra - python communication problem

2010-11-09 Thread Dr. Andrew Perella
create one? I had assumed Cassandra would commonly be used from python so would be able to jump in and use it. Cheers, Andrew -Original Message- From: Jonathan Ellis [mailto:jbel...@gmail.com] Sent: 09 November 2010 18:20 To: user Subject: Re: Cassandra - python communication problem

RE: Cassandra - python communication problem

2010-11-09 Thread Dr. Andrew Perella
nown result"); thrift.Thrift.TApplicationException: describe_keyspace failed: unknown result Any ideas? Cheers, Andrew -Original Message- From: Jonathan Ellis [mailto:jbel...@gmail.com] Sent: 09 November 2010 17:56 To: user Subject: Re: Cassandra - python communication problem You're

Cassandra - python communication problem

2010-11-09 Thread Dr. Andrew Perella
0 bytes I get similar errors for all queries. I cannot work out what is going wrong here. Can anyone help? (I need many of the functions only added in 0.7) Best Regards, Andrew This e-mail is confidential and may be privileged. It may be read, copied and used only by the intended recipient. N

unsubscribe

2010-10-26 Thread Andrew
unsubscribe

Re: What is K-table ?

2010-09-01 Thread Andrew Garman
yaw gmail.com> writes: > > Hi all, connecting to a cluster with cassandra-cli and trying a > describe command, > I obtain a "missing K_TABLE" message > Is this a real issue? I would chock this one up to a mix of user error and cryptic CLI message. describe command is describe keyspace

Re: EC2 small instance config?

2010-07-07 Thread Andrew Rollins
MySQL, so my next thing would be to try RAID with Cassandra. Also, another theory is that CommitLogSync in batch mode might allow me to effectively rate limit writing so that I don't overflow memory. Thoughts? - Andrew

EC2 small instance config?

2010-07-06 Thread Andrew Rollins
s of the data, which means less data to move after node failure). - Andrew

Re: Write assurance in Cassandra

2010-07-04 Thread Andrew Rollins
Is your IO under heavy load? If it is, that may be the cause, otherwise I'm not sure what causes significant lag. On Linux I like to use "iostat -tx 10" to check IO. - Andrew On Sun, Jul 4, 2010 at 4:04 AM, David Boxenhorn wrote: > Thank you very much! I now understand

Re: Write assurance in Cassandra

2010-07-04 Thread Andrew Rollins
. At first glance it seems like concepts should be independent of one another. - Andrew On Sun, Jul 4, 2010 at 3:34 AM, David Boxenhorn wrote: > As I understand it, when you write to Cassandra, you are assured that, if > successful, the new data has been written to a log file - so that

Distributed work-queues?

2010-06-26 Thread Andrew Miklas
ID into the row and then immediately read it back to confirm that they are the owner of the job. Any ideas here? Has anyone come up with a nice implementation? Is Cassandra not well suited for queue-like tasks? Thanks, Andrew

Cassandra Health Monitoring

2010-06-22 Thread Andrew Psaltis
is greatly appreciated. Thanks, Andrew

Re: how does cassandra compare with mongodb?

2010-05-13 Thread philip andrew
MongoDB encourages you to define your schema in your application code by using mapping classes. This logically infers that it makes no sense to define the schema twice, in the database and in your application code. On Fri, May 14, 2010 at 3:48 AM, Steve Lihn wrote: > What is changing? A more fle

Re: how does cassandra compare with mongodb?

2010-05-12 Thread philip andrew
Hi, >From my understanding, Cassandra entities are indexed on only one key, so this can be a problem if you are searching for example by two values such as if you are storing an entity with a x,y then wish to search for entities in a box ie x>5 and x<10 and y>5 and y<10. MongoDB can do this, Cassa

Re: Is SuperColumn necessary?

2010-05-06 Thread philip andrew
Please create a new term word if the existing terms are misleading, if its not a file system then its not good to call it a file system. On Thu, May 6, 2010 at 3:50 PM, Torsten Curdt wrote: > +1 on all of that > > On Thu, May 6, 2010 at 09:04, David Boxenhorn wrote: > > That would be a good tim

Re: Appropriate use for Cassandra?

2010-05-05 Thread philip andrew
http://www.youtube.com/watch?v=eaCCkfjPm0o 3.30 song begins 4.00 starfish loves you and Cassandra loves you! On Thu, May 6, 2010 at 11:03 AM, Denis Haskin wrote: > i can haz hints pleez? > > On Wed, May 5, 2010 at 9:28 PM, philip andrew > wrote: > > Starfish loves you. >

Re: Appropriate use for Cassandra?

2010-05-05 Thread philip andrew
Starfish loves you. On Wed, May 5, 2010 at 1:16 PM, David Strauss wrote: > On 2010-05-05 04:50, Denis Haskin wrote: > > I've been reading everything I can get my hands on about Cassandra and > > it sounds like a possibly very good framework for our data needs; I'm > > about to take the plunge and

Re: Cassandra data model for financial data

2010-04-29 Thread Andrew Nguyen
When making rough calculations regarding the potential size of a single row, what sort of overhead is there to consider? In other words, for a particular column, what else is there to consider in terms of memory consumption besides the value itself? On Apr 29, 2010, at 8:49 AM, Mark Jones wrot

Re: Cassandra data model for financial data

2010-04-29 Thread Andrew Nguyen
What is the upper limit on the number of super columns? Is it pretty much the same as for columns in general? On Apr 28, 2010, at 10:09 PM, Schubert Zhang wrote: > key : stock ID, e.g. AAPL+year > column family: closting price and valume, tow CFs. > colum name: timestamp LongType > > AAPL+201

Best way to store millisecond-accurate data

2010-04-23 Thread Andrew Nguyen
and the best way to model this, we would be adding 3.9 billion rows per year (assuming 125 Hz @ 24/7). However, I can't really think of a better way to model this... So, am I thinking about this all wrong or am I on the right track? Thanks, Andrew

How to model 2D data in Cassandra?

2010-04-17 Thread philip andrew
Hi, Lets say I wanted to store 2 dimensional data in the database, each object has a X and Y location in a very large space. I want to query Cassandra for all objects within a rectangle. My understanding is that my objects can only be indexed by one key, one key for each single object in my tabl

<    1   2